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TRANSFER IN MOTOR LEARNING AS A FUNCTION OF DEGREE 
OF FIRST-TASK LEARNING AND INTER-TASK SIMILARITY ! 


CARL P. DUNCAN 


Northwestern University 


The purpose of this study was to 
investigate transfer when both the 
first and second tasks consisted of dif- 
ferent patterns of S-R connections on 
the same apparatus. This type of 
transfer situation has not received 
much study, although it occurs fre- 
quently. Equipment requiring the 
operator to learn a fairly complex co- 
ordination, e.g., automobiles, air- 
planes, is constantly being modified. 
Such modifications often require the 
operator to learn a new S-R pattern. 
The present study is an attempt to 
provide data on transfer when one has 
to relearn operation of the same device 
because S—R relationships have been 
changed. 

A brief description of the apparatus 
may help clarify the transfer situation 
employed. The S responded to col- 
ored-light stimuli by moving a lever, 
grasped by the right hand, into a slot. 


1 This is the first of a series of papers on trans- 
fer in motor learning. The research was per- 
formed under Air Force Contract No. 33(038)- 
11396 between the Aero Medical Laboratory of 
the Wright Air Development Center, Air Re- 
search and Development Command, and North- 
western University. Thanks are due to Dr. 
B. J. Underwood, Assistant Contract Director, 
and to Dr. V. E. Montgomery, Chief Research 
Assistant. 


There were six slots, arranged as radii 
of a circle, and six lights. Learning 
the light associated with each slot was 
made difficult by requiring S to hold 
another lever, with the left hand, in a 
fixed position. One pattern of light- 
slot pairs defined Task I; Task II was 
provided by pairing the lights with 
different slots. Different degrees of 
inter-task similarity were obtained by 
varying the number of lights paired 
with new slots on the second task. 


Few studies relate to the present approach. 
The most similar work in verbal learning appears 
to be the case in which the stimuli of two lists 
learned successively were identical and responses 
similar. This condition resulted in positive 
transfer (6). 

Perhaps more relevant to the present study is 
the work of Lewis, McAllister, and Adams (3), 
where transfer was studied by reversing arrange- 
ment of controls on the Mashburn apparatus 
following practice on standard arrangement. On 
the reversed task S had to move cont-ols in the 
direction opposite to that on the standard task. 
Some interference, in terms of errors on the first 
trial of reversed-task practice, was found, but 
over-all transfer was positive. There was some 


evidence that both interference and positive 
transfer increased with degree of first-task learn- 
ing, but the relationship was not clear-cut. 

The method in the present study of defining 
similarity between tasks in terms of number of 
S-R connections changed from Task I to Task II 
appears similar to the procedure used by Crafts 
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Fic. 1. The apparatus, partly schematic 


(1), in an experiment on card sorting. Cards 
were sorted in nine boxes and Task II was pro- 
vided by changing, for different groups of Ss, 
positions of 0 (control), 3, 6, or all 9 boxes, 
Sorting time on Trial 1 of Task II increased 
directly with number of boxes with changed posi- 
tions. It is likely, extrapolating from the data 
Crafts presented, that transfer would have been 
positive if more trials than the two reported had 
been given on the final task. 


The above studies suggest we may 
expect to find positive transfer in the 
present experiment and to find it in- 
creasing directly both with degree of 
learning of Task I and similarity be- 
tween tasks. 


PROCEDURE 


Apparatus.—Figure 1 shows a partly sche- 
matic drawing of the apparatus. The figure 
shows the stimulus unit, the response unit, and 
an enlargement of one of the slots. The elec- 


trical connections and the recording unit are not 
shown.? 

Two levers of }-in. steel rod projected ver- 
tically 8} in. above the top surface of a box 37 in. 
long, 16 in. wide, and 8} in. deep. The levers, 
topped by knobs, were 21 in. apart. The right- 
hand lever was centered in a circular opening, 
1} in. in diameter, cut in the center of a steel 
plate, 16 in. square, that formed part of the top 
surface of the box. Six radial slots, 3} in. long, 
.26 in. wide, and spaced 60° apart, led from the 
perimeter of the 14-in. hole in the plate. The 
lower end of the right-hand lever was connected 
to a ball-and-socket joint, permitting movement 
of the lever into any slot. 

Each slot was equipped with three micro- 
switches. The first was } in., the second 2 in., 
from the entrance of the slot. The third was at 
the far end of the slot. With the lever in an 
incorrect slot a shallow error was recorded by 
tripping the first switch; tripping the second 
switch recorded a deep error. Depressing the 
third switch recorded a correct response. With 
one exception (see below) the error circuits were 


? Wiring diagram of the apparatus is available 
on request. 
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cut out for any slot whenever it was correct for 
the stimulus light showing. Correct responses, 
shallow errors, and deep errors were recorded on 
separate counters. 

The left-hand lever was supported below the 
surface of the box by a rubber pad, so it could be 
moved in three dimensions. Attached to the 
lower end of left lever was a small flexible spring 
projecting into a brass cup. Movements of the 
top of the left lever 3 in. or more in any direction 
except straight up touched the spring to the cup, 
breaking a circuit to a .0l-sec. Standard Electric 
clock. The left lever could not be held steady 
by lifting vertically because the knob, fitted on 
by a sleeve, would come off. Thus the left lever 
provided essentially a steadiness task. The clock 
recorded duration of time the lever was held 
correctly; if held incorrectly, a circuit was com- 
pleted to an oscillator generating a 1000-cycle 
tone. The tone, delivered to S through ear- 
phones, informed him to correct the lever’s posi- 
tion. The two levers were so connected that 
correct responses were not recorded if the left 
lever was out of position. At such times entry 
of the right lever into a correct slot yielded a 
deep error and the stimulus light would not 
change. 

The box containing the levers rested on a 
table the height of which was adjustable by 
means of pulleys and counterweights. 

Light sources for stimuli were 50-w. projection 
bulbs arranged in a semicircle so the light from 
each was projected onto the rear surface of a 
single ground-glass screen. The light from each 
bulb passed through a different gelatin filter to 
provide the six hues. Filters were made for the 
purpose and transmitted a fairly wide band of 
wave lengths. Although brightnesses and satu- 
rations undoubtedly varied, the hues could easily 
be differentiated. 

Projection bulbs were contained in a box 18 
in. long, 12 in. wide, and 9 in. high. Viewed by 
S, the stimulus box presented a black surface in 
the upper center of which was a circular opening, 
23 in. in diameter, covered by the ground-glass 
screen. The screen was 30 in. from S’s eyes. 
Stimulus hues were seen by S as diffuse color 
filling the screen. 

Each of the six stimuli appeared eight times 
in a randomized sequence of 48 lights controlled 
by a rotary switch with 48 positions. The 
sequence repeated itself continuously. The same 
light never appeared twice in succession. Some 
Ss went through the sequence many times but 
there was no évidence of memory for serial order. 

A panel of six telephone jacks permitted E to 
connect any light with any slot, so a new task 
could be provided for S in a few seconds. A 
microswitch riding on a cam driven by a l-rpm 
motor automatically provided 20-sec. trials with 


10-sec. intertrial rests. The apparatus could 
also be set for continuous run. 

Experimental design.—The two variables stud- 
ied were degree of learning of Task I and simi- 
larity between tasks. Task I was learned for 
10, 40, 80, or 180 trials and there were three 
degrees of inter-task similarity. All 12 combi- 
nations of these conditions were studied in a 
factorial design with 25 Ss in each of the 12 cells. 
Notation that is used throughout the paper is 
shown in Table 1; roman numerals indicate 
Learning Groups (75 Ss each), capital letters 
indicate Similarity Groups (100 Ss each). 

Table 1 shows number of trials given each 
Learning Group on Task I. All Ss were given 
60 trials on Task II. For all Ss 2 min. elapsed 
between the last trial of Task I and the first trial 
of Task II. 

Distribution of the experiment over days was 
as follows: Group I, all trials on both tasks on the 
same day; Group II, Trials 1-35 of Task I on 
one day, Trials 36-40 of Task I and all trials on 
Task II on the succeeding day; Group III, Task I 
Trials 1-40 on Day 1, 41-75 on Day 2, and 76- 
80 and Task II on Day 3; Group IV, Task 
I Trials 1-60 on Day 1, 61-120 on Day 2, 121-175 
on Day 3, and 176-180 and Task II on Day 4. 
Note that in Groups II, III, and IV the last five 
trials of Task I were given immediately preceding 
the start of practice on Task II to eliminate 
effects of forgetting over the last 24-hr. rest, in 
this respect making these groups comparable to 
Group I where there was no 24-hr. rest. 

The tasks.—Task I was defined as a particular 
pairing of lights with slots. Task II was a dif- 
ferent pairing of lights and slots. Inter-task 
similarity was defined in terms of the number, 
out of a-possible six, of light-slot connections 
that were different on Task II from what they had 
been in Task I. In Group A (high similarity) 
two lights were connected to different slots for 
Task II; the other four lights were paired with 
the same slots throughout both tasks. Group B 
(medium similarity) had four lights paired with 
different slots in Task II; Group C (low simi- 
larity) had all six lights and slots newly paired in 
Task II. 














TABLE 1 
DEsIGN OF THE EXPERIMENT 
Inter-task Similarity 
Degree of 
Task I Learning 
A(High) | B(Medium) | C(Low) 
I (10 eh 25 25 25 
II (40 trials 25 25 25 
III (80 trials) 25 25 25 
IV (180 trials) 25 25 me 

















Although it was assumed that all 720 combi- 
nations of lights with slots would be equally 
difficult to learn, only 25 combinations, chosen so 
each light was connected to each slot an approxi- 
mately equal number of times, were used for 
Task I. Each S within a subgroup of 25 Ss 
was assigned a different combination before prac- 
tice began on Task I. 

For each combination used in Task I, three 
other combinations were made up for use in 
Task II. One of these had high similarity, one 
medium, and one low. In choosing the Task II 
combinations for each Task I combination, 
wherever possible one of the other 24 Task I 
combinations was used. Thus, the Task II com- 
bination for any one S was often one used by a 
different Sas Task I. We shall therefore assume 
that the tasks were not only equal in difficulty 
but were in a sense identical. Because of this 
assumption, separate control groups were not 
run. Instead, data from Task I were used to 
construct control curves for learning Task II. 

Controls—The control curve for learning 
Task II was constructed from data of Groups III 
and IV on Task I. These were the only groups 
given at least 60 trials on Task I, the number 
given on Task II. 

Subjects and method.—The Ss were 300 males 
from introductory psychology classes at North- 
western University. They were assigned ran- 
domly to the 12 subgroups. They were rewarded 
with one point of examination credit for each 
day served and were told the experiment was 
being done for the Air Force; as a result, they 
appeared to be highly motivated. 

When Ss first appeared for the experiment, 
they were asked to name the six hues as E pre- 
sented them in the stimulus window. This was 
continued until E was satisfied S could easily 
discriminate the lights. Upon questioning, a 
few Ss said they were color-blind, but if they 
could name the hues, they were used. One S 
was rejected for inability to discriminate the 
lights. 

Following hue naming, S took his place in 
front of the apparatus with hands on levers. 
Height of the table was adjusted until knobs of 
the levers were approximately 1 in. below S’s 
elbow height. Since S stood up for all practice 
trials, this height seemed to be least fatiguing. 
The S was then instructed on the apparatus and 
what he had todo. In pilot experiments lengthy 
instructions had been developed, of which the 
gist was as follows: S was told the experiment 
concerned transfer and division of attention, the 
former explained with reference to different 
makes of cars, the latter by showing S he could 
not turn off lights unless the left lever was held 
correctly. The S put on earphones and listened 
to the oscillator tone. He was told to learn, by 
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moving the right lever into slots, which light was 
connected to which slot, and to turn off as many 
lights as possible during each 20-sec. trial. He 
was further told that during the 10-sec. intertrial 
rest he would hear how many correct responses 
he had made in the previous trial. 

Between trials, S remained in position at the 
apparatus with hands on levers and earphones 
on. After every block of 20 trials on the same 
task a 3-min. rest was given during which S took 
off earphones, sat down, and was permitted to 
smoke and talk. 

When S returned on subsequent days, he was 
told the lights and slots were paired the same as 
before. Upon completion of the required num- 
ber of trials on Task I, S was given a 2-min. rest 
while light-slot connections were changed for his 
Task II. The S was then instructed that 2, 4, 
or all 6 lights (depending upon the similarity con- 
dition) were now connected to different slots and 
practice was started on Task II. 


RESULTS 


Comparability of groups on Task I.— 
Before we can present data on the 
comparability of groups in perform- 
ance on Task I, it is necessary to show 
that the measures are reliable. A reli- 
ability coefficient based on all 300 Ss 
was computed from the total correct 
responses on the first five odd and first 
five even trials. The coefficient was 
.790, corrected to .883. 

Another coefficient was computed 
from the total of the first 20 odd and 
first 20 even trials, where the available 
N was 225. The uncorrected coeffi- 
cient was .965. Since most of the find- 
ings are based upon correct responses 
as the measure, reliability coefficients 
were not computed for other response 
measures. The coefficients obtained 
indicate sufficient reliability to permit 
group comparisons. 

Since we are concerned with trans- 
fer, data for acquisition of Task I are 
not presented in detail. To determine 
if random assignment of Ss to sub- 
groups resulted in equal performance 
on Task I simple analysis of variance 
was performed on total correct re- 
sponses of the first ten trials, which 
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TABLE 2 
Mean Scores on THE Last Two TRIALS 

or Task I 
Correct Steadiness Time 

Responses (Sec.) 
Group 

Mean oM Mean oM 
I 4.63 .237 9.67 479 
Il 10.54 416 12.32 472 
III 13.20 344 13.87 369 
IV 15.63 387 14.52 403 
Grand mean} 11.00 | .294 | 12.59 | .224 

















permits comparison of all 12 sub- 
groups. It was first demonstrated 
that the 12 variances were homogene- 
ous; Bartlett’s test resulted in a chi 
square of 1.92 with 11 df. Since the 
variance within groups (149.62) was 
larger than that between groups 
(108.06), the subgroups appear to be 
adequately matched. 


The three subgroups within each 
Learning Group were also tested for 
equality of performance by four sep- 
arate analyses of variance. For each 
analysis the score was total correct 
responses for all trials given that par- 
ticular Learning Group on Task I. 
The F’s were less than one in Groups 
I, Il, and III. In Group IV F was 
2.69, with 2 and 72 df. Since F at the 
5% level for 2 and 80 df is 3.11, the 
subgroups within each Learning Group 
can be considered equal in perform- 
ance on Task I. 

Table 2 shows mean correct re- 
sponses per trial and mean steadiness 
time per trial with the left lever for 
each Learning Group at the end of 
practice on Task I. Values in the 
table are based on the mean of the 
last two trials of Task I practice for 
each group. The values are used to 
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measure gains or losses in transferring 
to Task II and will be shown on graphs 
depicting acquisition of Task II. 

Transfer: Correct Responses—Ac- 
quisition of Task II is shown in Fig. 2 
and 3. In Fig. 2 the three Similarity 
Groups have been combined for each 
of the four Learning Groups to show 
the effect on Task II of different de- 
grees of learning of Task I when inter- 
task similarity is held constant. In 
Fig. 3 Learning Groups have been 
combined to show effect of varying 
inter-task similarity. The control 
curve is shown in both figures, and 
both include heavy points, at the left 
of the abscissa break, representing 
level of mastery at the end of Task I. 
These heavy points were taken from 
Table 2. 

Let us first inspect the control curve 
in Fig. 2 and 3. It will be recalled 
that this is the curve for the first 60 
trials of Groups III and IV on Task I. 


S OF TRIALS 


of inter-task similarity with Task I learning 
tant 


The curve shows a sudden rise from 
the eleventh to twelfth abscissa points 
(Trials 21-22 and 23-24), and shows 
both a drop and a rise from the twen- 
tieth to twenty-second abscissa points. 
The drop is due to forgetting over the 
24-hr. rest given Group III between 
Trials 40 and 41. The rises are prob- 
ably indirectly due to rests; it will be 
recalled that in addition to 24-hr. rests 
a 3-min. rest was given after each 
block of 20 trials. Although we shall 
not present the data, there was some 
evidence that these rests had a com- 
plex effect. Occasionally there was 
some forgetting, or loss of set, indi- 
cated by a slight drop in score on the 
trial immediately following rest. How- 
ever, on the second trial after rest 
there frequently was a gain (reminis- 
cence?) larger than on any previous 
trial. 

Referring to the curves for the ex- 
perimental groups, both Fig. 2 and 3 








INITIAL DECREMENT IN CORRECT 
RESPONSES ON SECOND TASK 
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show there is considerable initial de- 
crement, indicated by the drop from 
end of Task I to beginning of Task II. 
A clearer picture of initial decrement 
may be obtained from Fig. 4, where 
initial loss in correct responses on 
Task II is plotted as a function of 
trials on Task I. The three curves in 
the figure represent similarity condi- 
tions. Values of points on the curves 
were computed by subtracting the 
mean of last two trials on Task I from 
the mean of first two trials on Task II; 
thus, a loss is shown by a minus value. 

Figure 4 shows that initial loss is a 
negatively accelerated increasing func- 
tion of Task I learning. Differences 
among the curves show decrement 
increased as inter-task similarity de- 
creased. It is clear that initial per- 
formance on Task II is a function of 
both variables under consideration. 

Returning to Fig. 2 and 3, we see 
that, regardless of initial decrement, 
there is no interference, at least in 
terms of correct responses; no curve 
falls below the control curve except 
Group I for the last four trials. The 
over-all transfer effect is clearly posi- 
tive. No test of significance appears 
necessary; it can be concluded that all 
experimental groups showed positive 
transfer. 
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Fic. 4. Initial decrement on Task II meas- 
ured as the difference between mean of last two 
trials on Task I and mean of first two trials on 
Task II 


TABLE 3 


ANALYsis OF VARIANCE oF ToTAL CorRRECT 
Responses ror Aut 60 TrRIALs 
on Tasx II 











Source af Mean Square F 
Learning 3 | 693,260.89 | 39.54* 
Similarity 2 123,939.03 7.06** 
Lx5& 6 7,786.30 
Within groups 288 17,532.52 














*F = 3.88 at the 1% level, 2.65 at the 5% level, for 3 
and 200 d, 


** F = 4.71 at the 1% level, 3.04 at the 5% level. 
for 2 and 200 df. 


It cannot be determined from Fig. 2 
and 3 whether or not the two variables 
interact. To assay this, double classi- 
fication analysis of variance was per- 
formed on total correct responses for 
all 60 trials of Task II. First it was 
determined that the hypothesis of 
homogeneous variance among the 12 
subgroups could not be rejected; chi 
square by Bartlett’s test was 7.56. 

The analysis of variance is sum- 
marized ir. Table 3. It can be seen 
that learning and similarity do not 
interact significantly. Using variance 
within groups as error, the F’s show 
that both learning and similarity are 
highly significant. 


The analysis just presented utilized all 60 
trials, so it tends to mask possible changes in 
effectiveness of the variables over practice. Ex- 
amination of Fig. 2 and 3 indicates that differ- 
ential mastery of Task I may have been effective 
throughout all trials on Task II, whereas inter- 
task similarity does not appear differentially 
effective after Trial 30. The possibility that 
learning had an effect that lasted later in practice 
than the effect of similarity was tested by sep- 
arate analyses of variance, one on total correct 
responses for the first 30 trials, and one on the 
last 30 trials, of Task II. This division of trials 
into first and last 30 has no theoretical basis; it 
was chosen arbitrarily from inspection of Fig. 2 
and 3. 

It was first demonstrated that variances of 
the 12 subgroups on the first 30 trials were 
sufficiently homogeneous to permit analysis; chi 
square was 4.59. It was assumed without test- 
ing that homogeneity of variance obtained for 
the last 30 trials. 








We shall not present summary tables for these 
analyses. For both, the df, and thus F’s required 
for significance, are those given in Table 3. 
Analysis of the first 30 trials showed learning 
highly significant (F = 61.11), similarity highly 
significant (F = 17.32), and interaction not sig- 
nificant. For the last 30 trials learning was 
highly significant (F = 18.31), similarity not 
significant (F = 1.11), and interaction not sig- 
nificant. Interaction variance was less than 
within-groups variance in both analyses. These 
analyses therefore demonstrate that differential 
positive transfer produced by varying Task I 
learning extended later in practice than that 
resulting from varying similarity. 


Transfer: Errors.—It will be recalled 
that deep errors were recorded both 
when right lever traversed completely 
an incorrect slot and when the slot 
was correct but left lever was out of 
position. Because the two types of 
error were not measured separately, 
the deep-error count turned out to be 
a rather gross and not particularly 
helpful measure. The shallow-error 
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count better represented incorrect 
responses, since it was based only on 
partial entries of right lever into incor- 
rect slots. Separate analyses were 
made of both error counts and of total 
error score obtained by combining 
shallow and deep errors. Results of 
these analyses will not be presented 
in detail because they yielded very 
little information beyond that derived 
from analysis of correct responses. 
For example, error curves for Task II 
also showed positive transfer; all 
experimental group curves were lower 
(showed fewer errors) than the control 
curve until the last ten trials (Trials 
51-60), where all curves began to over- 
lap. There were clear differences both 
among the three curves for Similarity 
Groups and the four curves for Learn- 
ing Groups for about the first 30-40 
trials on Task II, after which there 
were little or no differences in errors 
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produced by various degrees of either 
variable. Finally, there was almost 
no evidence of interference in terms of 
errors. As in the case of correct 
responses, all groups showed consider- 
able initial decrement (increase in 
errors) from the end of Task I to the 
beginning of Task II, but only Learn- 
ing Group I and Similarity Group C 
made more errors than the control 
group, and this only on first two trials 
of Task II. 

Transfer: Time scores.—The S’s task 
with the left lever remained the same 
throughout all phases of the experi- 
ment; viz., to hold the lever steady. 
The effect this might have on transfer 
will be discussed later. It does not 
seem necessary, however, to present a 
detailed analysis of the time score, i.e., 
the time per trial the left lever was 
held in correct position. The one 
graph presented, Fig. 5, shows time in 
seconds per trial on Task II for the 
Learning Groups. Similarity Groups 
are combined within each Learning 
Group. The graph also includes the 
control curve and the heavy points, 
taken from Table 2, showing the level 
reached at end of Task I. Figure 5 
shows that there was some loss in time 
score, at least in Groups II, III, and 
IV, at start of practice on Task II; 
however, the decrement was regained 
by the fourth or fifth trial. Thus the 
decrement demonstrated by using cor- 
rect responses (and, in fact, errors) as 
measures, since it lasted much later in 
practice than the fifth trial, was not 
due merely to loss of skill with the 
left lever. It may also be seen in Fig. 
5 that Groups I and II, and perhaps 
Group III, continued to improve in 
steadiness throughout practice on 
Task IT. 


Discussion 


The most striking result of the ex- 
periment was the large amount of 


positive transfer obtained. Every 
degree of first-task learning facilitated 
acquisition of Task II, and facilitation 
occurred for every variation of inter- 
task similarity. These results corrob- 
orate those of Lewis, McAllister, and 
Adams (3); when stimuli remain the 
same from task to task and direction 
of movement involved in the responses 
is changed, positive transfer results. 

It is believed that two factors are 
responsible for the positive transfer 
obtained here: response generalization 
and learning-how-to-learn. Concern- 
ing response generalization, Under- 
wood (6) has argued that this factor 
is responsible for the positive transfer 
typically found in learning a second 
paired-associate verbal list whose 
stimuli are identical with, and re- 
sponses similar to, those of the first 
list. Since he presented the theory 
in some detail, it is only necessary to 
summarize it here. The basic assump- 
tion is that reinforcement of a response 
during learning the first task or list si- 
multaneously results in some strength- 
ening of similar (generalized) responses. 
A gradient of response generalization 
is also assumed; the more similar 
responses are to the reinforced response 
the more they are strengthened. Thus, 
as a result of “parasitic” reinforce- 
ment of generalized responses during 
practice on the first list, there will be 
some tendency to make responses of 
the second list even before practice 
begins. The result is facilitation of 
second-list learning. 


In Underwood’s experiment response similar- 
ity was varied over several steps. In the present 
study whatever degree of response similarity 
existed, it was the same for all Ss. (Note that 
we are considering response similarity, not inter- 
task similarity as it is defined in the present 
experiment.) It seems reasonable to assume that 
the responses of the two tasks used here were 
highly similar since even in the group where all 
stimuli and responses were newly paired on 
Task II, only the direction of movement was 
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changed, not always by as much as 180°. Gagné, 
Baker, and Foster (2) have argued that direction 
is only one of several dimensions of movement 
similarity. Thus the present experiment may be 
compared with the condition of high response 
similarity in Underwood’s study. For this con- 
dition (high response similarity) there was posi- 
tive transfer to the second task in both experi- 
ments and the transfer increased sharply with 
degree of first-task learning. It may also be 
pointed out that in the Underwood experiment, 
although in general positive transfer increased 
with first-task learning at each degree of response 
similarity, the increase was greater and more 
clear-cut with high response similarity. 

The other source of positive transfer is that 
which results from “learning-how-to-learn” (4). 
A practice effect in learning successive samples 
of the same or similar material has often been 
observed (for summaries, see 4 and 5). Learning- 
how-to-learn almost certainly occurred, since Ss 
were naive to the tasks. Furthermore, S’s task 
with left lever remained the same at all times, 
i.e., to hold the lever correctly. Since correct 
responses with the right lever could not be made 
unless the left lever was held correctly, we may 
have a specific case where learning-how-to-learn 
was required. Differences in positive transfer 
resulting from different degrees of Task I learn- 
ing, shown in Fig. 2, may be in part due to differ- 
ences in skill with the left lever, shown in Fig. 5. 

Since learning-how-to-learn was not con- 
trolled, we cannot know how much it contributed 
to positive transfer and how much was due to 
response generalization. However, learning- 
how-to-learn was probably not the only source 
of transfer, since in Underwood’s experiment 
with verbal lists (6), where it was better con- 
trolled, considerable transfer resulted from 
response generalization alone. 

It was noted above that our results are quite 
similar to those obtained by Lewis, McAllister, 
and Adams (3), where positive transfer was ob- 
tained when the second task was provided by 
reversing the direction of movement of the con- 
trols on the Mashburn apparatus. It is believed 
that the interpretation of positive transfer given 
above can, without change, also be applied to 
the findings of Lewis et al. 

Variation of what may be called “formal” 
similarity between the tasks in the present study 
was accomplished by varying the number of 
stimuli and: responses newly paired on Task II. 
The result was much the same as that found by 
Crafts (1) when he changed positions of various 
numbers of boxes in card sorting; as similarity 
decreased, positive transfer decreased. How- 
ever, in the present experiment there was no 
evidence of interference when the measure used 


was correct responses; as Fig. 3 shows, even the 
group practicing the least similar task (Group C) 
was well above the control on all trials. It is 
true that, as reported above, there was slight 
interference in Group C in errors, but it did not 
extend beyond the third trial. 

It seems important to emphasize the precise 
effect upon transfer obtained by varying the 
number of changed S-R connections; within 
limits of such variation obtainable here, transfer 
varied directly with number of S-R connections 
changed. This relationship between transfer 
and mere number of new S-R pairs held for all 
degrees of Task I learning; there was no evidence 
of interaction between learning and similarity. 
There is no a priori reason why this should be 
true. Without evidence to the contrary one 
could just as easily assume that in a complex 
task, where high performance depends upon inte- 
gration of several functionally interrelated com- 
ponents, changing even a few components would 
disrupt performance as severely as changing 
several. That this did not occur in the present 
experiment lends some justification to analyzing 
even complex tasks in terms of specific stimuli 
and responses. 


SUMMARY 


Transfer between two tasks, both 
available on the same apparatus, was 
studied as a function of two variables: 
degree of learning of the first task and 
similarity between tasks. The Ss 
were required to associate each of six 
colored-light stimuli with movements 
of a lever, grasped by the right hand, 
into six radially arranged slots, while 
simultaneously holding another lever 
steady with the left hand. There 
were four degrees of first-task learning 
and three degrees of inter-task simi- 
larity arranged as a factorial design in 
which 300 Ss were used, 25 in each of 
12 cells. The degrees of learning 
were: 10, 40, 80, and 180 trials. Simi- 
larity between tasks was defined in 
terms of the number of lights newly 
paired with different slots on the final 
task; in the three similarity conditions 
either 2, 4, or all 6 of the lights were 
paired with different slots on the final 
task. All Ss were given 60 trials on 
the final task. The results were: 
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1. Transfer was positive for all 
experimental groups whether meas- 
ured in terms of correct responses or 
errors. Transfer increased directly 
both with first-task learning and inter- 
task similarity. 

2. There was no interaction between 
learning and similarity. 

3. Differential positive transfer re- 
sulting from varying first-task learning 
was obtained throughout acquisition 
of the finaltask. Differential positive 
transfer resulting from varying inter- 
task similarity occurred only during 
the first half of second-task practice. 

4. Loss in performance from end of 
the first to beginning of the final task 
varied directly with both variables, 
but in all experimental groups initial 
performance on the final task was 
superior to the control. 

5. There was almost no evidence of 
interference, measured by errors, be- 
tween tasks. 

6. Transfer was attributed to a 


combination of response generalization 
and learning-how-to-learn. 


(Received July 17, 1952) 
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THE LAW OF PRIMARY REINFORCEMENT IN CHILDREN ! 


PAUL S. SIEGEL AND JAMES G. FOSHEE 
University of Alabama 


In protest of strict “rat psychology,” 
Hilgard has advanced a serious argu- 
ment for more naturalistic observa- 
tion and the utilization of the human 
subject in learning research (1). He 
feels that principles coming from the 
animal laboratory should be validated 
at a human level. Similar considera- 
tions have inspired several recent ex- 
periments. For example, Hubbard 
(2) has investigated secondary rein- 
forcement in the human subject and 
Saltzman (5) has studied the effect 
of delayed reward. The findings of 
both are in accord with the results of 
infrahuman studies. 

The present experiment was de- 
signed to examine the principle of 
primary reinforcement utilizing the 
human subject. Stated briefly, this 
principle maintains that the strength 
of a habit is positively related to the 
number of times it has been rewarded 
or accompanied by need or drive reduc- 
tion. It was elected because of its 
central role in reinforcement-type 
learning theories. The experimental 
findings of Perin (4) and of Williams 
(8) are usually adduced in its support 
(3). Employing the rat in a bar- 
pressing situation, they varied the 
number of reinforcements and meas- 
ured subsequent resistance to extinc- 
tion. The relationship was found to 
be a positive one. Our methodology 
parallels theirs with one significant 
variation. We employed the child as 
a subject. 


!'This research was supported in part by a 
grant from the Research Committee of the Uni- 
versity of Alabama. 
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METHOD 


Apparatus—The apparatus consisted of a 
Masonite box, painted green, of outside dimen- 
sions, height, 20 in., depth, 10 in., and width, 
64in. In front, at the base, was a wooden tray, 
34 X 3} in. Four inches above the tray, a }-in. 
steel rod projected 3 in. A pressure of 30 gm. 
resulted in an excursion of the rod through 
approximately 1} in. This served to close a 
mercury switch, energizing an electromagnetic 
release mechanism. AA single pellet of candy was 
released from a magazine and fed by chute to 
the tray. An impulse counter and a remote 
switch were wired in series with the release 
mechanism. Following the fall of each pellet, 
E opened the circuit to prevent “jiggling” and 
thus ensure a discrete response for each rein- 
forcement. During the extinction series, this 
switch remained closed. The pellets were fash- 
ioned by splitting party mints and trimming to a 
cylindrical shape. Each weighed approximately 
.2 gm. The box was either placed upon the 
floor or on a nursery school table. The S was 
always free to sit or stand as he chose. The E 
was seated to one side at a distance of 3 to 6 ft. 
depending upon the size of the experimental 
room. 

Subjects.—The ages of Ss ranged from 2 yr., 
11 mo. to 5 yr., 8 mo. The Ss were divided into 
four groups, matched for age within a range of 
3 mo., and balanced for sex. There were ten 
males and ten females in each of four groups 
(total = 80). The four groups received 2, 4, 8, 
or 16 reinforcements. In addition to the 80 Ss 
constituting the experiment, 24 others were tried 
and discarded for various reasons. Nine failed 
to follow instructions. In 11 instances, the 
apparatus jammed. In 4 cases, E could not 
match for age. 

Procedure-—Experimental sessions were con- 
ducted from 9:30 a.m. to 10:30 a.m. Most of the 
children were reported as eating breakfast at 
approximately 7:30 a.m., making for a food- 
privation interval of 2 to 3 hr. The junior 
author served as £ throughout the entire experi- 
ment, making for a uniform social environment. 
To establish rapport, E devoted a minimum of 
4-hr. social interaction with each nursery school 
or kindergarten group from which Ss were 
recruited. 

The experimental procedure was conducted 
as follows. The E entered the school yard and 
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TABLE 1 


Tue RELATIONSHIP BETWEEN RESISTANCE TO 
EXTINCTION AND NuMBER OF Prior 














REINFORCEMENTS 
- Number of Responses during Extinction 
Prior Rein- 

orcements : M SD 
Mean | SD | Median low Lom 

2 32.9 | 49.8 | 10.0 | 1.12 59 

4 41.0 | 34.0 | 29.0 | 1.49 84 

8 51.6 | 33.7 | 49.5 | 1.64 82 

16 94.9 | 48.9 | 99.5 | 1.91 89 




















inquired of a child if he cared to see a new game. 
If he responded negatively, no effort was made to 
coax him. If he indicated acceptance, he was 
led to a nearby experimental room and told that 
he could sit or stand before the apparatus. The 
E then pointed to a single pellet in the tray and 
asked, “What is it?” If S indicated recognition, 
he was told that he could eat it. If no recogni- 
tion was apparent, £ told the child that it was 
candy and that he was free to eat it. Next, E 
pointed to the lever and inquired, “What do you 
think would happen if you pushed this?” If S 
pressed the lever, ate the candy, and pressed 
again, £ said nothing further. If S pressed the 
lever and then failed to eat the candy, E said, 
“You may have as much as you like.” No 
further comment was offered. The S was given 
a predetermined number of reinforcements, the 
magazine having been loaded with 2, 4, 8, or 16 
candy pellets. The extinction series was ushered 
in by the first failure to secure a pellet. The 
number of unreinforced responses made by the 
child in the 3 min. immediately following the 
last reinforced response was recorded. This is 
our measure of resistance to extinction? 

During extinction, E made no response to 
questions addressed him by the child. He 
simply maintained a “poker face.” 


RESULTs AND Discussion 


Table 1 summarizes our findings. 
The means, standard deviations, and 
medians for the four distributions of 
raw scores are found there. The rela- 
tionship between the mean and the 
median suggested positive skewness, 


2 A simple enumeration of the number of un- 
reinforced responses occurring during a brief 
interval of time has been reported by Walker 
(7) as a reliable index of resistance to extinction 
in the bar pressing of the rat. 


particularly in the instance of the two 
groups receiving few reinforcements 
(also true of the Perin-Williams rat 
data). Snedecor’s g, test for skew- 
ness (6) was applied and significance 
at the .01 level was indicated. A 
logarithmic transformation of the data 
was made and found to be essentially 
normal. The means and SD’s of the 
log values are also presented in Table 1. 

The data of Table 1 strongly sug- 
gest a positive relationship between 
resistance to extinction and number 
of prior reinforcements. As the num- 
ber of reinforcements increases, the 
average number of extinction responses 
(both means and medians) increases. 
The statistical significance of this 
trend may be evaluated in several 
ways. Basically, we must assess the 
significance of the differences obtained 
among groups. We have applied two 
tests, one to the nonnormal raw score 
distributions and one to the normal or 
logarithmic transformations. The 
first, a chi-square analysis, is found in 
Table 2. The median of the total 
distribution (combining all groups, 
n = 80) was determined and a simple 
count of subjects yielding scores above 
and below this value was noted for 
each group. If resistance to extinc- 
tion is independent of number of rein- 
forcements, then with only chance ex- 
ception, ten cases should fall above 


TABLE 2 


Cui-Square ANALYsIS FOR DIFFERENCES 
Amonc Groups 























Bel 
Ren Median Median N 
forcements of Total of Total 
Group Group 
: 5(10) 15(10) 20 
4 7(10) 13(10) 20 
8 12(10) 8(10) 20 
16 16(10) 4(10) 20 
40 40 80 
x? = 14.8 


p< .01 
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this median and ten below within each 
group. The chi-square value of 14.8, 
corresponding to a probability value 
of less than .01, indicates that the 
hypothesis of independence is scarcely 
tenable. We conclude that significant 
differences obtain among our four 
groups or that resistance to extinction 
is genuinely dependent upon number 
of reinforcements. The second test, 
applied to the transformed distribu- 
tions, consisted of calculating the 
regression coefficient for extinction 
upon reinforcement. This was found 
to be .0504 with a standard deviation 
of .00845. The t of 5.96 indicates a 
probability value of less than .O1. 
Again, we conclude that the relation- 
ship is statistically significant. 

In the Perin-Williams study, resist- 
ance to extinction was defined as the 
number of unreinforced responses 
yielded by the rat to a criterion of no 
responses within a 5-min. interval. 
Perin (4) has imposed an exponential 
function upon the median values with 
reasonably good agreement. This 
function could be fitted to our data. 
Unfortunately, with but four empirical 
points, many other general functions 
could be equally well fitted. Nothing 
seems to be gained by this procedure. 
We are satisfied to state that our 
results are in good general agreement 
with the Perin-Williams findings. It 
may be fairly said that with the child, 
we have confirmed the positive rela- 
tionship between resistance to extinc- 
tion and number of prior reinforce- 
ments. And, viewing our measure of 
extinction resistance as an index of 
habit strength, our results constitute 


a validation of the principle of pri- 
mary reinforcement at the human 
level. 

SUMMARY 


Employing the child as subject in a 
bar-pressing situation, the principle 
of primary reinforcement was ex- 
amined. Number of reinforcements 
(candy) was varied for each of four 
groups. The number of unreinforced 
responses occurring within a 3-min. 
interval immediately following the 
last reinforced response was recorded. 
In good agreement with infrahuman 
studies, it was found that resistance 
to extinction or habit strength is posi- 
tively related to number of prior 
reinforcements. 


(Received July 7, 1952) 
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ASSIMILATION OF INFORMATION FROM DOT AND 
MATRIX PATTERNS 


E. T. KLEMMER AND F. C. FRICK 


Human Resources Research Laboratories 


Any situation involving perception 
may be considered in terms of the 
transmission of information. What is 
perceived may be described, not only 
by words and numbers related to the 
physical nature of the stimulus, but 
also in terms of an “amount of informa- 
tion’’ measure. Although this meas- 
ure is usually defined in other terms, 
Fano (2) has shown that the informa- 
tion associated with any event is 
proportional to the negative logarithm 
of its probability of occurrence. Un- 
common stimuli convey much infor- 
mation, whereas expected stimuli con- 
vey little. Since the information 
measure is a function of the probabil- 
ities of the stimuli and responses in the 
situation under study, it has units 
which are not specific to the particular 
stimulating conditions or particular 
responses. The measure may be ap- 
plied without logical difficulty to any 
situation in which one is willing to 
identify the members of the stimulus 
and response classes and make some 
statements about their probability 
distributions. Whether or not the 
measure is useful in the analysis of 
human behavior remains to be proven. 
Early results from its application are, 
however, encouraging (3, 4, 5, 6, 7). 

The present study attempts to do 
two things: first, demonstrate the 
application of the information measure 
to a simple visual perception, and, 
second, discover the maximum amount 
of information that can be assimilated 
using a particular visual code. 

Suppose we ask: How much infor- 
mation do we take in, or how much do 
we learn, about spatial location in a 
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single glance? The way we have gone 
about answering the question is to 
give a group of Ss a very quick look 
at one or more dots on a plane and 
ask them to reproduce the location of 
the dots. We then applied the infor- 
mation measure to the presented 
stimulus and S’s responses and came 
out with at least a partial answer to 
our question. 


MetTHOoD 


Apparatus.—The visual stimuli were photo- 
graphed on 2 X 2-in. slides and projected on a 
screen from a 300-w. slide projector fitted with a 
.03-sec. shutter. The slides consisted of one or 
more white dots inside the white outline of a 
square on a black background. For some of the 
tests the square contained internal grid lines 
which divided the square into a square matrix. 
The dot(s) were always located in the center of 
the cells of the matrix. When the internal grid 
lines of the matrix were not included on the slide, 
the dots were positioned in the centers of imagi- 
nary cells of a matrix whose order was equal to 
the order of the matrix on S’s answer sheet. 
The S’s responses in each case were made on 
answer sheets printed with appropriate matrix 
patterns. 

Subjects Undergraduate students at the 
University of Maryland served as paid Ss. The 
same Ss were used for the first four tests. A 
different group was used for the last eight tests. 

Procedure.—Fifteen to 25 Ss were run simul- 
taneously in a semidarkened room. The Ss faced 
a 40 X 40-in. projection screen which was slightly 
larger than the projected outline of the square. 
A “test” is defined as a series of slides of the 
same matrix order and a defined number of dots. 
Before each test, Ss were informed as to the 
number of dots which might appear and whether 
or not the internal grid lines would appear on the 
screen. They were told that the dot(s) could 
occur in any position and that all positions were 
equally likely. During the test each possible 
position of the dot(s) was presented once. The 
order of presentation was random. The Ss were 
not told, and showed no sign of learning, that no 
stimulus occurred twice in the same test. They 
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TABLE 1 


Conpitions or Tests AND NuMBER OF 
Ss Empioyep 











i m 
Test | Matrix | Number] titer. | Lines on| Ss 
natives Slides 
1 3x3 1 9 Yes 26 
2 4x4 1 16 Yes 28 
3 6X6 1 36 Yes 28 
+ 8x8 1 64 Yes 22 
5 |10x10 1 100 Yes 42 
6 |10x10 l 100 No 79 
7 |10Xx10 l 100 No* 80 
8 | 20x20 1 400 No 80 
9 3x3 3 84 No 87 
10 3x3 4 126 Yes 80 
11 3x3 4 126 No 80 
12 3x3 1-4 255 No 79 




















* Also no grid lines on answer sheets. 


were instructed to mark the position of the dot(s) 
on the answer-sheet grids in the same relative 
position as the dots on the projected square. 
The exposure time was held constant at .03 sec. 
throughout the experiment. At least two prac- 
tice slides were given before each test and a 
verbal ready signal was given before each flash. 

The basic experimental design was as follows: 
In the first experiment the information content 
of the single-dot slides was increased by increas- 
ing the order of the matrix used to position the 
dot. Thus, for one dot on the 3 X 3 matrix 
there were nine possible positions. Each slide 
constituted a single stimulus from, in this case, 
a set of nine alternative stimuli. For the 4 X 4 
matrix there were 16 possible stimuli, and so on. 

In the second experiment the matrix order 
was held constant at 3 X 3 and the number of 
dots varied from one through four. This was 
done both by combining all of the one- through 
four-dot slides into a single test with random 
ordering of number as well as position of the 
dots and also by showing the four-dot slides and 
the three-dot slides as separate tests. (The one- 
and two-dot slides on a 3 X 3 matrix are received 
perfectly whether shown alone or in combination 
with the three- and four-dot slides.) 

Two pairs of tests were run in which the mem- 
bers of each pair differed only in that one had 
the internal grid lines included on the projected 
slide and one did not. For one test there were 
no internal grid lines on either the slides or the 
answer sheets. The dot was positioned on the 
slide in the center of a cell of an imaginary 
10 X 10 matrix. The Ss made small marks on 
answer-sheet squares trying to locate the dot as 
nearly as possible in the same relative position 
as the dot on the slide. The answer sheets were 


scored by superimposing a 10 X 10 matrix over 
the squares and tabulating the cells into which 
Ss’ marks fell. 

Table 1 presents the several orders of mat- 
rices, the number of dots, the grid conditions, 
and the number of Ss for each test. 

Scoring.—All of the tests were scored in terms 
of the “Amount of information” transmitted. 
The information content of each stimulus is 
simply the negative logarithm of its probability 
of occurrence. If logarithms to the base two 
are used, we define a unit which has been termed 
the bit (2, 8). 

In order to compute the information success- 
fully transmitted, distributions were obtained of 
the responses of all Ss for each stimulus sep- 
arately. The uncertainty associated with these 
distributions (sometimes called response equivo- 
cation) was then estimated from the relative 
frequencies of the observed responses. The 
average response equivocation was subtracted 
from the information content of the stimulus to 
get the transmitted information. This method 
assumes that the total informational content of 
the response is equal to that of the stimulus. 
This assumption was checked in several of the 
tests and found to be true within the limits of 
accuracy of these experiments. For a fuller dis- 
cussion of the methods for computing trans- 
mitted information the reader should see Garner 
and Hake (4). 


RESULTS 


Before considering the effect of 
changing the number of cells or num- 
ber of dots (increasing the informa- 
tional input) let us look at the effect 
upon the information transmitted of 
including or omitting the internal 
grid lines. The results from com- 


TABLE 2 


Errect or InTeRNAL Grip LINES UPON THE 
TRANSMISSION OF INFORMATION ABOUT THE 
Position or Dots 1n A SQUARE 

















Grid Lines 
va | “ats [Sept] ——__—| a 
Slides Sheets 

5 10x10 1 Yes Yes 4.4 

6 | 10x10 1 No Yes 4.4 

7 | 10X10 1 No No 4.4 
10 3x3 4 Yes Yes 6.6 
11 3x3 4 No Yes 6.6 























* Information transmitted in bits per stimulus. 
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parable tests, with and without such 
grid lines, are presented in Table 2. 

From these results, which are repre- 
sentative of the entire range of tests in 
this study, it was concluded that inclu- 
sion or omission of grid lines on the 
slides had no effect on the amount of 
information transmitted. Thus, tests 
are compared along other dimensions 
without reference to grid lines on the 
slide. 

The effect upon information trans- 
mitted of varying the number of cells 
in the matrix is shown in Table 3 for 
the single-dot tests. Increasing the 
number of cells in the matrix increases 
the number of alternative stimuli and 
therefore increases the amount of in- 
formation presented. 

It appears from Table 3 that the 
information transmitted increases with 
the information presented, but only 
to about 4.4 bits, where it becomes 
constant. This suggests that, in a 
single glance, we take in approxi- 
mately 4.4 bits of information about 
the location of a point in a plane. 

The results of the multiple-dot tests 
with the 3 X 3 matrix are given in 
Table 4. Only the three- and four- 
dot tests were run as separate tests, 


TABLE 3 


TRANSMISSION OF INFORMATION ABOUT THE 
Position oF A SIncLE Dot IN A 
Matrix VisuaLty Exposep 

















For .03 Sec. 
—— ~~ - Bits 
Test Matrix per Stimulus 
Presented | Transmitted 
1 3x 3 3.2 3.2 
2 4x4 4.0 3.9 
3 6X 6 5.2 4.4 
4 8x8 6.0 4.5 
5,6,7 10 X 10 6.6 4.4 
8 20 X 20 8.6 5.0* 











* Value is inflated by small sample bias. Number of 
aon” was small compared to number of alternatives 


TABLE 4 


TRANSMISSION OF INFORMATION ABOUT PosITION 
or One THROUGH Four Dots In a 3 x 3 
Matrix Visuatty Exposep 























For .03 Sec. 
stente ry .. Bits 
N be umber per Stimulus 
Test | of Dots | Tatives 
Presented |Transmitted 
1 1 9 3.2 5.2 3a" 
-— 2 36 5.2 5.1* 
9 3 84 6.4 6.1 6.2* 
10,11 4 126 7.0 6.6 6.7* 
_ 1-2 45 5.5 5.4* 
-— 1-3 129 7.0 6.9* 
12 | 1-4 | 255 80 | 7.8 








* Computed from Test 12. 


since the one- and two-dot messages 
are received perfectly, or almost per- 
fectly, by all Ss. Note that the num- 
ber of alternative stimuli and, there- 
fore, the average information presented 
goes up rapidly as the number of dots 
is increased. Combining several num- 
bers of dots into a single test added 
alternative stimuli and further in- 
creased the information presented. 

Only the values in the first infor- 
mation-transmitted column were ob- 
tained from separate tests. However, 
as the results of the three- and four-dot 
tests clearly indicate, there was no loss 
in accuracy of response when these 
stimuli are included in the one-to- 
four-dot test. This is an interesting 
finding. Adding the three-, two-, and 
one-dot stimuli to the four-dot test did 
not affect the responses to the four-dot 
messages but it did double the number 
of alternatives and thus add one bit to 
the information-carrying capacity of 
the four-dot messages. 

It is possible to consider the one-, 
two-, and three-dot messages as special 
cases of the four-dot series. In gen- 
eral, n-dot messages may be assumed 
to include all messages from one to n. 
Following this reasoning the one-to- 
four-dots test is scored in parts as a 
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two-dot test which includes the one- 
dot slides and a three-dot test which 
includes the one- and two-dot slides. 
The result of this scoring method is 
shown in Table 4. Since the response 
to any of these slides does not seem to 
be a function of the company it keeps, 
it may be concluded that, in this case, 
reducing the number of alternatives 
by selecting only some of them for use 
is an inefficient means of encoding. In 
other words, using only the four-dot 
slides is inefficient when adding the 
one-, two-, and three-dot slides 
increases the information-carrying 
capacity of the system without in- 
creasing the error. 

Note that there was very little loss 
of information in any of the multiple- 
dot tests. It is probable, therefore, 
that the value for information trans- 
mitted for each number of dots is 
below S’s maximum capacity. Never- 
theless, it is clear that increasing the 
number of dots from one through four 
does increase the information-carrying 
capacity of the message from the 4.4 
bit maximum for one dot to at least 
6.6 bits for the four-dot case or 7.8 
bits for one-to-four-dots. This is the 
second finding of this experiment. 
Increasing the number of dots from 
one to four increases the maximum 
amount of information transmitted. 


Discussion 


In order to get an idea of what the 
4.4- to 7.8-bit range found in this 
experiment means, let us compare it to 
the information assimilated from other 
types of messages. An S can repro- 
duce without error three to four ran- 
domly chosen letters when they are 
seen for .O1 sec. (1). This represents 
an information transfer of 14.1 to 18.8 


bits. Under the same conditions S 
can read about two unconnected 
words (1). On the assumption that 


the words are drawn at random from a 
vocabulary of 10,000 this represents 
26.6 bits. It is possible that pictures 
would show an even higher value of 
information assimilated but no satis- 
factory measurements have yet been 
made. 

At the other extreme, if S is asked 
to locate a single point on a straight 
line, the information successfully trans- 
mitted is in the neighborhood of 3 bits 
(5). 

It is clear that the maximum amount 
of information that can be assimilated 
from a brief visual exposure is a func- 
tion of the type of encoding used. 
The question immediately arises as to 
whether or not there is a common 
metric which may be applied to the 
different message classes and which 
will correlate with the maximum infor- 
mation-carrying capacity of that class. 

Of course, we are not in a position 
on the basis of these scanty data to 
draw general conclusions applicable to 
other forms of encoding. However, 
there is a suggestion here. It takes 
two coordinates to specify the position 
of a single dot in a plane. It takes 
four coordinates to specify two dots, 
six for three dots, etc. These coordi- 
nates may be considered as dimensions 
of a message space in which each stim- 
ulus is a single point. It requires only 
one coordinate, or dimension, to spe- 
cify the position of a point on a line. 
This latter type of stimulus has a 
maximum information-carrying ca- 
pacity of about 3 bits. We have seen 
that the two coordinates of the single 
dot on a plane can transmit about 4.4 
bits and the eight coordinates of one- 
to-four-dots at least 7.8 bits. Thus it 
appears that increasing the dimensions 
of the message space increases the 
information-carrying capacity of the 
stimulus, in this range at least. We 
hasten to add that there is no certainty 
that the message dimensions are inter- 
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changeable, i.e., that only the number 
of dimensions is important. 


SUMMARY 


An experiment was designed to dem- 
onstrate the application of information 
measure to a simple visual perception 
and to find the maximum information 
about a two-dimensional spatial loca- 
tion which can be assimilated from a 
brief visual exposure. The stimuli 
consisted of one or more dots within 
the outline of a square. This pattern 
was flashed on a projection screen for 
.03 sec. The Ss reported the positions 
of the dots by making marks in corre- 
sponding positions on answer sheets 
printed with the outline of a square 
and containing internal grid lines. 
The dots were positioned on the pro- 
jected slide in positions corresponding 
to centers of cells in the S’s answer 
matrix. Using a single dot, the order 
of the matrix on the S’s answer sheet 
was increased from 3 X 3 to 20 X 20 
in separate tests without changing the 
outside dimensions of the square. 
Using a 3 X 3 matrix the number of 
dots on each slide was increased from 
one through four in separate tests. 
The data were scored in terms of the 
amount of information successfully 
transmitted by S from the screen to 
his answer sheet. 

1. The Ss transmitted a maximum 
of 4.4 bits of information about the 
position of a single dot in a square 
from a .03-sec. exposure. 

2. The information transmitted in- 
creased with the number of dots used 
to at least 6.6 bits per exposure for 
four dots. 


3. Combining the one-, two-, three-, 
and four-dot slides into a single series 
did not result in less accuracy than 
when each number of dots was shown 
separately. Because of the increased 
number of alternatives, however, the 
information transmitted increased to 
7.8 bits per exposure. 

4. Including internal grid lines on 
the projected square had no effect on 
the information transmitted. 

5. Removing the internal grid lines 
from S’s answer sheet had no effect 
upon the information transmitted by 
a single dot. 


(Received June 23, 1952) 
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DRIVE CONDITIONING AS A FACTOR IN LATENT LEARNING! 


PAUL E. MEEHL AND KENNETH MacCORQUODALE 


University of Minnesota 


In 1929 Blodgett (1) reported a now- 
classic experiment in which rats which 
had been running a maze without food 
reward exhibited a sudden drop in 
errors on the day following a single 
rewarded run. We do not propose 
here to discuss the experimental reality 
of this “Blodgett effect,” which has 
not as yet been shown to be con- 
sistently reproducible (2, 9, 11, 16), 
nor the theoretical and statistical con- 
troversies involved in its interpreta- 
tion, which have been treated at 
length by Kendler (6), Maltzman (8) 
and Thistlethwaite (12, 13) in recent 
papers. Assuming for present pur- 
poses that the effect does occur, we 
wish to examine its theoretical impli- 
cations from another angle. Critics 
of S-—R-reinforcement theory from 
Tolman on (14, p. 343; 15, p. 194) 
have taken it for granted that the 
feeding in the goal box is what brings 
about the subsequent sudden appear- 
ance of an error decrement. But 
actually the experiment does not 
establish this, since no control feeding 
outside the goal box or temporally 
separated from a maze run was em- 
ployed. From the human observer’s 
viewpoint (especially if he has cogni- 
tive-map leanings) it is quite natural 
to assume that if a feeding in the goal 
and following a run yields the Blodgett 
effect, this particular spatio-temporal 
locus is what does the business. But 
such an assumption clearly requires 
scrutiny. 

This is the more so because there is 
an intriguing S—R-reinforcement possi- 

1This study was made possible through a 


research grant by the Graduate School of the 
University of Minnesota. 


bility which presents itself as soon as 
the above issue is once raised. It is 
admitted on all sides that some decline 
in errors is shown by the nonrewarded 
rats during the “latent phase.” We 
need not consider here what rewards 
may be operative to produce this 
already manifest learning (but cf. 7). 
The sticker for a reinforcement the- 
orist, at least as we perceive it, hinges 
on the quantitative question, “Why 
such a drop from one reinforcement at 
this level of the learning curve?” 
Now one of the ways to get sudden 
shifts in performance without equally 
sudden changes in habit (or cognitive!) 
variables, according to S—R-reinforce- 
ment theory, is to alter motivation. 
Discussions of the Blodgett effect tend 
to assume that the place to look for 
the explanation is in the operation of 
reinforcement and resulting habit 
strength. But if in this design there 
is inherent some mechanism that pro- 
vides for a shift in the rat’s degree of 
hunger, might this not be a better 
place to look? If sER is a multipli- 
cative function of sHR and f(D), in- 
creasing either factor will increase sER. 

Human introspection suggests that 
the “‘subjective state” of hunger, for a 
given interval of deprivation, can be 
manipulated by exteroceptive cues. 
Presumably MHull’s Corollary i of 
Postulate III in the 1949 set, “Second- 
ary Motivation” (4, p. 175), is in- 
tended to cover this kind of environ- 
mental control over drive level itself. 
There is independent evidence (3, p. 
33) that some of the presumed “‘com- 
ponents of being hungry,” such as 
stomach contractions,  salivation, 
damped ingestive responses, and even 
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blood-sugar level can be brought under 
some degree of exteroceptive control. 
Thinking along these lines, we were 
struck by the following hypothesis: 
During the latent phase, differential 
habit strengths are accumulating in 
favor of the “correct” choices (as the 
data show). At the end of the first 
rewarded run, the rat feeds in the goal 
box and by the above corollary ac- 
quires a conditioned hunger drive (or, 
more precisely, his hunger drive, 
which during the latent runs has been 
based upon mere deprivation, is now 
strengthened by the new exteroceptive 
conditioning). On the following day’s 
run, this conditioning generalizes to 
the similar (and partially overlapping) 
stimulus complex of being “in the 
maze,” so that the rat, although under 
the same deprivation as on previous 
days, is now “hungrier” than he was 
on these days. Hence, we have an 
augmented drive factor which, accord- 
ing to Hullian theory, will multiply by 
the (also augmented!) habit strengths 
already favoring the correct path and 
hence produce the Blodgett drop. 

Admittedly this is a touchy hypoth- 
esis to test, and we must emphasize 
that the design which follows is not 
presented as an experimentum crucis 
to clinch any such argument. As 
usual in such “derivations,” the 
hypothesis does not actually quantify 
the parameters but it does require that 
certain rather complex relations ob- 
tain among their magnitudes. We 
have tried to test just one implication 
of such a hypothesis, and this test can 
only be added to the other numerous 
lines of evidence and theory needed 
to explicate Blodgett’s experiment. 
However, the prediction confirmed is 
somewhat paradoxical and its rather 
bizarre character perhaps makes its 
success more probative than would 
ordinarily be the case. 

For if the above hypothesis were 


correct, one should be able to produce 
the drop in errors by a feeding not in 
the goal box and not at the conclusion 
ofarun. In fact, a feeding experience 
administered in a sufficiently “maze- 
like situation” should bring about the 
desired drive conditioning. (We are, 
of course, mot talking here of “pre- 
feeding,” i.e., just before a day’s run, 
as a facilitator.) Accordingly, we 
determined to substitute a feeding 
experience in a maze-like but extra- 
maze box for the single rewarded run 
introduced by Blodgett, meanwhile 
avoiding the possibility of a direc- 
tional or locational learning by actively 
opposing any such factor through the 
arrangement of apparatus. 


MeETHOD 


Subjects.—Thirty male albino rats, approxi- 
mately 90 days old and experimentally naive, 
were used. These were subsequently subdivided 
into two matched groups of 15 rats each, as 
described below. 

A pparatus.—We used a replica of the six-unit 
multiple-T Blodgett maze; unlike his, ours had a 
permanent floor. The maze was placed on the 
floor, diagonally across the experimental room, 
which was square, windowless, otherwise unfur- 
nished, and quiet. The illumination came from 
a 7}-w. overhead light, plac’ ’ symmetrically 
with respect to the maze and providing a uniform 
brightness of less than 3 ft.-candles at maze level. 

The extra-maze box was a 2-ft. long section 
of alley identical in composition, color (black), 
and in height and width dimensions with the 
alleys of the maze. This was placed on a stand 
in the experimental room, so that its floor was 
4 ft. above the level of the maze floor, about one 
yard in front of the starting box of the maze in 
the opposite direction from the goal box. The 
long axis of this box was at right angles to the 
major (start-to-goal) axis of the maze. This is 
the box in which half the rats were to be fed 
after a series of nonrewarded maze runs. 

Procedure-—Test breaking: Since the maze 
trials in this experiment were to be run under 
conditions of food deprivation but without pri- 
mary food reinforcement in the maze, we took 
special precautions to assure that emotionality in 
the experimental situation would be minimized, 
and that a feeding rhythm would be established, 
during the test-breaking phase. This lasted 
seven days. Throughout this period each rat 
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was handled and carried about for 14 min. daily. 
Each rat was given about 10 gm. of wet mash 
daily at the hour at which he would be fed during 
the experimental sessions proper. Feeding was 
done in special feeding cages of different con- 
struction and dimensions from the home cages. 
On the fourth, fifth, and sixth test-breaking days, 
the rats were placed, in squads of five, in a tunnel- 
type straightaway for 20 min. The long axis of 
the straightaway was at right angles to the long 
(start-to-goal) axis of the maze. The point of 
entry into the straightaway was randomly 
varied; the point of removal was always from the 
middle two-thirds of the alley. The straight- 
away had no goal boxes and never contained 
food. From the straightaway the animals were 
removed to individual “detention” cages, differ- 
ing markedly from both home and feeding cages, 
for 20 min., thence to the feeding cages where 
food was withheld for 40 more minutes, e.g., for 
one hour after removal from the straightaway. 
The purpose of the detention cages was to reduce, 
or at least temporally delay, the secondary rein- 
forcing effects of the feeding cages. On the 
seventh test-breaking night the straightaway 
was not used, but instead the rats were placed 
singly in the extra-maze box (as described above) 
for 3 min., then removed to the detention and 
feeding cages as before. 

Maze running: All rats were treated iden- 
tically during the first five days. One maze trial 
was given daily, under 22}-hr. food deprivation. 
Immediately preceding his day’s run, each rat 
was placed for 30 sec. in the extra-maze box, 
from which he was removed directly to the 
starting box of the maze. There was no food in 
the goal box of the maze at any time. Retracing 
was prevented by swinging doors. The rats 
were detained in the goal box 1 min. following 
the run. Then, as during the test-breaking 
period, they were removed to the detention 
cages for 20 min., thence to the feeding cages, 
where food was withheld for at least another 
40 min. Records were kept of running times 
and of calibrated errors; an error score of 1 was 
recorded if the rat placed his head and forepaws 
but not his hind feet into the cul; a 2 was recorded 
if his body, including the hind feet, entered the 
cul but his nose did not pass the point marked 
34 in. from the end of the cul; a 3 was recorded if 
his nose passed this mark. According to this 
system, only cul penetrations to the depth scored 
2 or 3 would have been counted by Blodgett as 
errors; henceforth such errors will be called 
Blodgett errors. After the fifth night’s runs, 
two pairwise matched groups of 15 rats each were 
formed. The matching was on the basis of 
several error criteria: total number of Blodgett 
errors over all five days; summed weighted errors 
over all five days; total Blodgett errors on Day 5; 


summed weighted errors on Day 5. The close- 
ness of the matching may be inferred from the 
failure of the curves in Fig. 1 and 2 to diverge 
markedly over the first five trials, and from their 
convergence on Trial 5. 

On the sixth night, neither group was run in 
the maze. The rats were handled in the usual 
order, but those which were assigned to the 
“extra-maze fed” group were placed in the extra- 
maze box where, for the first time, they found a 
cup of wet mash, which they were permitted to 
eat for 2 min. The food cup was located about 
two-thirds of the alley length from the end at 
which the rats were inserted. The “not-fed” 
animals were also placed in the extra-maze box 
for 2 min., with no food present. All rats were 
removed from the extra-maze box to the deten- 
tion and feeding cages as before. Each extra- 
maze-fed rat was ultimately given the same food 
cup he had eaten from in the extra-maze box, to 
ensure that this group did not eat more than the 
nonfed group. If, then, the drive-conditioning 
mechanism does occur as a result of eating in a 
similar situation, its effects should be observable 
in the greater error reduction of the extra-maze 
fed group, as compared with the nonfed group, 
on the subsequent run. 

The sixth, or test, trial was run the following 
night. As before, rats were placed for 30 sec. in 
the extra-maze box (no food or food cup being 
present), and then performed one unreinforced 
maze run. Errors and time were recorded as 
before. 


RESULTS 


Since there was a definite anteced- 
ent expectation that the box-fed rats 
would do better on the test run than 


Mean Blodgett Errors— Matched Groups 
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the nonfed, it seems appropriate to 
employ one-tailed significance tests 
throughout (5). Figure 1 shows the 
performance of the two groups in 
terms of “Blodgett errors” as defined 
above. The? test for matched groups 
on the test run is 2.01 (.02 < p <.05). 
For eight of the matched pairs the fed 
were superior to the nonfed, while for 
two pairs the reverse obtained. The 
other five pairs were equal. It should 
be noted that 28 of the 30 Blodgett 
scores lie in the narrow range of scores 
from 1 to 3, so this test is perhaps not 
very sensitive. 

A more sensitive measure of actual 
performance is shown in Fig. 2, plot- 
ting the daily “‘weighted errors” as 
defined above. Here also the t test is 
2.01 (.02 < p < .05). Inspection of 
the distributions shows that 73% of 
the nonfed exceed the median error 
score of the fed, and 73% of the latter 
fall below the former’s median. One- 
third of the nonfed made more errors 
than any of the fed. Of the 15 
matched pairs, 10 show a difference 
score favoring the fed rat, 1 pair is 
equal, and 4 show the nonfed superior. 
Examining these discrepancies by 
means of Wilcoxon’s nonparametric 
matched pairs signed rank test (10, 
p. 126) we find a T of 27.5 as the sum 


Summed Weighted Errors - Matched Groups 
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of ranks with less frequent sign, con- 
trasted with a chance expectancy of 60 
(p < .05; 18, p. 120). The median 
test, which does not take advantage 
of the experimental matching, yields 
a p between .02 and .05 (10, p. 125). 

An analysis of the individual rats’ 
gains in weighted errors between the 
last prefeeding run and the test run 
was almost exactly at p = .05 (t = 1.73, 
14 df) for Blodgett errors, and between 
the 3 and 4% levels of confidence for 
weighted errors (t = 1.996). Wil- 
coxon’s test again yields a sum of 27.5 
(p < .05). 

The effect of the extra-maze feeding 
upon running times was somewhat 
more striking. Seventy-three per cent 
of the nonfed exceed the median 
running time of the fed, and all of the 
fed fall below the nonfed median. 
The mean running time for the two 
groups differed by 25.2 sec. (36.1 for 
the fed, 61.3 for the nonfed). Attest 
(matched formula) yields t = 3.62 
(p < .01). Wilcoxon’s test shows a 
T = 10 instead of the theoretical 60, 
which is at the 1% level of confidence. 
It seems appropriate to conclude that 
the extra-maze feeding exerted a sig- 
nificant error-reducing and speeding- 
up effect upon maze performance 
during the next day’s run. 
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SUMMARY AND CONCLUSIONS 


Hungry rats were run in the six-unit 
Blodgett maze one trial per day for 
five days, no food reward being given. 
During this period they showed a 

- reduction in errors which had leveled 
off by the fifth day. ‘Two groups were 
then formed by matching individual 
rats by pairs in terms of several 
measures of their performances to that 
time. On the following day none of 
the subjects ran the maze, but were 
placed in a familiar mazelike box 
located several feet above the maze 
and behind the maze end opposite the 
goal box. One group fed in this 
box on that day, the other did not. 
On the following day’s run the fed 
rats were significantly superior to the 
nonfed in both time and errors. The 
results were those predicted from a 
hypothesis that the Blodgett effect 
is at least partly due to conditioning 
of hunger drive to the exteroceptive 
stimuli of the maze situation. 


(Received July 10, 1952) 
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NUMBER OF COMMON ELEMENTS AND CONSISTENCY OF 
REINFORCEMENT IN A DISCRIMINATION LEARNING TASK ! 


ROBERT STANTON FRENCH 
Perceptual and Motor Skills Research Laboratory, Human Resources Research Center 


In this study discrimination learn- 
ing is investigated as a joint function 
of (a) the degree to which stimuli 
within a group are similar, and (5) 
the consistency of the reinforcement 
of a given response to these similar 
stimuli. These variables were chosen 
because it is believed that in their 
interaction they constitute primary 
determinants of learning. Results 
reported from a rapidly expanding 
area of research have given abundant 
confirmation to the proposition that 
with an increase in number of common 
elements, or an increase in the proxim- 
ity of stimulus values along either a 
physical or response-defined dimen- 
sion, there is almost invariably an 
increase in frequency of occurrence of 
a common response. 

Equally well established is the prin- 
ciple that a more difficult task con- 
fronts the individual when a common 
response is not reinforced or other- 
wise encouraged, but instead a number 
of different responses are required. 
Simply stated, the learner must ignore 
the obvious similarities between the 
stimuli while discovering in what 
respects the stimuli may be differenti- 
ated. Clearly, as the number of dif- 
ferent responses required increases, or 
as the proportion of similar stimuli to 
be associated with each of two classes 
of response approaches one-half, the 
difficulty of the task should increase 


1'The experimental work for this study was 
performed as part of the United States Air Force 
Human Resources Research and Development 
Program. The opinions or conclusions contained 
in this report are those of the author. They are 
not to be construed as reflecting the views or 
indorsement of the Department of the Air Force. 
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proportionately. Thus, task difficulty 
should always increase when consis- 
tency of reinforcement, defined in 
either of the above ways, increases 
(assuming a degree of similarity 
among the stimuli great enough to 
produce generalization). 

One cannot, however, describe the 
effect of varying stimulus similarity 
with such a simple statement, since 
there is every reason to believe that 
the effect of varying similarity will be 
markedly different depending upon the 
consistency of reinforcement. Con- 
sidering the extremes, if a common 
response is reinforced for all of a 
group of similar stimuli, increasing 
the degree of this similarity clearly 
simplifies the task. On the other 
hand, if two classes of responses are 
involved, and if half of the similar 
stimuli are to be associated with each 
response, increasing the degree of 
stimulus similarity should have the 
opposite effect; likewise, intermediate 
effects should be produced by inter- 
mediate levels of the two variables. 
Note that a set of converging func- 
tions is being described with stimulus 
similarity on the abscissa, and con- 
sistency of reinforcement as a param- 
eter. According to the hypothesis, as 
the degree of stimulus similarity 
decreases, the range of effect produced 
by changes in consistency of reinforce- 
ment decreases likewise, until finally 
with little or no similarity among the 
members of the “class” of stimuli, 
variations in consistency of reinforce- 
ment produce no measurable changes 
in performance. 

Perhaps the largest share of research 
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concerned with stimulus similarity in 
its relation to learning has been in the 
area of transfer of training. The 
resemblance of the present hypothe- 
sized interaction to the change in 
transfer effect from positive to negative 
under varied stimulus and response 
similarity conditions should be evi- 
dent. We shall consider in detail in a 
later section the theoretical implica- 
tions of this apparent equivalence. 

In the area of discrimination learn- 
ing and concept formation there has 
been perhaps less systematic manipu- 
lation of stimulus similarity and con- 
sistency of reinforcement. A typical 
problem of discrimination learning 
involves the discrimination between 
two classes of stimuli with only one 
of the classes “correct” or associated 
with reward. If a common response 
(perhaps choosing a form or object 
either literally or symbolically) is rein- 
forced for all of the members of one 
of these classes of stimuli (and by im- 
plication a common response of avoid- 
ance is reinforced for the other class), 
then the consistency of reinforcement 
is clearly maximum. Perhaps the 
simplest and most effective way of 
reducing this consistency by degrees 
to a minimum is to interchange the 
reinforcement or reward value of an 
increasing proportion of the stimuli 
from the two classes until half of each 
class is correct and half incorrect. 
For present purposes this latter con- 
dition defines minimum consistency 
of reinforcement. 

Certain aspects of the present ex- 
perimental hypothesis were tested by 
French (2). In his study the stimuli 
consisted of various combinations and 
permutations of four-letter aggregates 
of consonants. Degree of stimulus 
similarity was varied by manipulating 
the number of common letters within 
each of two classes of stimulus items. 
In addition to five degrees of stimulus 


similarity, two experimental condi- 
tions were contrasted which in their 
assignment of correct and incorrect 
stimulus items duplicated exactly the 
conditions described as maximum and 
minimum consistency of reinforcement 
for the present study. The possibility 
of intermediate degrees of consistency 
was not appreciated at the time. In 
brief, the results of this study clearly 
substantiated the hypothesized inter- 
action. 

There seems little reason to doubt 
that these two independent variables 
play a major role as determinants of 
learning, at least in the kind of dis- 
crimination task described. The pres- 
ent study is a repetition of the first 
in some respects, and as such can offer 
important corroboration. In addi- 
tion, however, the present study 
permitted measurement of response 
latencies as well as errors, investigated 
intermediate values of consistency of 
reinforcement, and utilized abstract 
forms rather than letters. The use of 
abstract forms would seem to offer 
greater possibilities of generalizing the 
results to a variety of configurations 
than is possible with verbal materials. 

By way of summary, the experi- 
mental hypothesis to be investigated 
may be restated briefly: It is proposed 
that discrimination learning perform- 
ance is a joint function of (a) the 
number of common stimulus elements 
within a group of forms, and (bd) the 
proportion of these similar forms for 
which the response of “choice” is 
reinforced or indicated as “correct.” 
According to the hypothesis, with 
high consistency of reinforcement, in- 
creasing the number of common ele- 
ments decreases the difficulty of 
the discrimination task progressively. 
However, as the consistency of rein- 
forcement is reduced, the function de- 
scribing decreasing difficulty changes 
slope progressively and is finally 
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replaced by a function describing 
increasing difficulty. Thus, in effect, 
an interaction between the two inde- 
pendent variables is hypothesized. 


MeETHOD 


Apparatus.—In order to test the hypothesis 
under consideration a technique was needed for 
presenting pairs of forms repeatedly in a varied 
order to a group of Ss, with measurement of both 
the kind of response and its latency. These 
needs were met through the use of a projection 
apparatus with electric controls so arranged that 
four clocks started simultaneously at the mo- 
ment of presentation of each stimulus pair upon 
the screen. Each of four Ss, seated in a row 
before the screen, had a small covered box on 
the arm of his chair. This box contained two 
microswitch buttons. The S was instructed to 
indicate his choice by depressing the button on 
the side corresponding to the position of one of 
the stimuli appearing side by side on the screen. 
Depression of either key stopped the correspond- 
ing clock and illuminated one of two colored 
lights on £’s control panel. Six seconds after 
the pair of stimuli appeared on the screen (an 
interval sufficiently long that all Ss had made a 
choice), E threw a switch illuminating a small 
red light either on the right or left side of the 
screen. This light indicated to the Ss which of 
the two stimuli was the correct choice for that 
particular pair. This indicator light, along with 
the two stimuli, remained on for 6 sec., allowing 
time for a careful examination of the forms. 
Thus the stimuli were presented for a total of 
12 sec. with an indication of the correct choice 
after 6 sec. The interval of time between trials 
was held at approximately 15 sec. This general 
procedure was used for all of the conditions of the 
experiment. The experimental setup required 
two Es. One experimental assistant operated 
the slide projector while another recorded the 
responses and their latencies on prepared data 
sheets. 

Subjects —The Ss were 384 airman basic 
trainees at Lackland Air Force Base. Twenty- 
four Ss were assigned to each of the 16 primary 
experimental conditions. 

Each group of Ss was given general instruc- 
tions in the operation of the response keys. The 
Ss were told that their task was to make as many 
“correct” choices as possible as indicated by the 
position of the red light. No mention was made 
of the possibility of identical segments in the con- 
tours successively presented. Thus, the task 
involved the discovery of the basis for a “correct” 
choice as well as the fixation of this response 
through repeated reinforcement. 


Stimuli.—The stimulus forms employed in 
this study are shown in Fig. 1. Although six 
forms are illustrated as constituting each arbi- 
trary group or class, actually 12 2 X 2-in. slides 
were constructed for each condition. Six of 
these slides reversed the right-left position of the 
forms as they appeared on the screen. 

Figure 1 illustrates the manner in which the 
number of common segments was varied through 
four degrees. Within each of these degrees the 
“correct” (plus) and “incorrect” (minus) groups 
are indicated. The particular stimulus forms 
always paired together are indicated within each 
of the degrees by the successive rows. Thus for 
each degree, six pair of stimulus forms are shown. 
In Fig. 1 consistency of reinforcement is held 
constant at 100%. 

The general principles followed in the con- 
struction of the forms may be described with 
reference to the zero common-segment condition. 
Each of the forms in this condition, and through- 
out all of the conditions of the experiment, is 
composed of some three-segment combination of 
18 basic segments joined to form a closed con- 
tour. These basic segments were constructed 
by asking a group of Ss unfamiliar with the 
experiment to draw any continuous irregular line 
they wished between two specified points. (These 
two points represented a distance equivalent to 
one-third of the circumference of a circle of given 
size.) From these 18 basic segments three-seg- 
ment combinations were chosen to form two 
groups of six forms each. Certain principles 
were followed in the choice of segments for each 
of these two groups. Each of the 18 segments 
appears once in each group but never in the same 
three-segment combination. This balancing 
eliminates any obvious communality in one 
group not present in the other as well. Note also 
that the designation “zero common segments” is 
valid only with reference to the six-member 
groups of forms individually, and not with refer- 
ence to pairs of such groups. 

In the condition specified as “one common 
segment” each group is characterized by the 
possession of a common segment unique to that 
group. This common segment is indicated 
at the side of each group. Note that the com- 
mon segment appears at a different compass 
orientation within each of the six forms. The 
communality or homogeneity within the group 
of forms is increased successively by the intro- 
duction of a second and then a third common 
segment. The common segments and their junc- 
tures are shown at the side of each group. With 
the addition of a third common segment the 
forms within each group differ among themselves 
only in degree of rotation about the transverse 
axis. It may be noted that as the number of 
common segments within each group is increased 
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Fic. 1. Stimulus forms arranged to illustrate the increase in common segments from zero to 
three within paired six-member groups of forms. Plus and minus symbols are used to indicate the 
correct and incorrect stimuli. The common segments and their junctures are indicated at the side 
of each group. 
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TABLE 1 


Error Means anp STANDARD DeviaTIONS 
(N in each group = 24) 








Number of Common Segments 











Consistency 
Roin- 0 1 2 3 
forcement 
Mean SD Mean SD Mean SD Mean SD 

0% 26.88 13.20 31.29 11.85 39.63 12.34 46.83 6.55 
33% 26.96 10.24 27.58 10.49 28.00 12.24 38.50 8.38 
67% 28.10 12.55 15.90 9.67 20.33 8.22 22.75 10.94 
00% 24.42 11.48 4.92 5.38 3.63 6.59 0.90 1.05 





























successively, certain segments which were bal- 
anced between the two paired groups drop out 
altogether. This procedure must be followed 
if the number of common segments is to be 
increased without increasing the total number 
of segments within each form. Since this pro- 
cedure of replacing “noncommon” segments 
with common segments was employed through- 
out the study, the variable represented may be 
thought of as a parameter. 

For purposes of this study, consistency of rein- 
forcement was varied in four degrees by inter- 
changing the “reinforcement values” of a certain 
proportion of the forms from the paired groups. 
To produce variation in consistency of reinforce- 
ment an identical procedure was followed for 
each of the four similarity conditions. For 100% 
consistency the six forms containing the same 
segments were all correct; likewise the paired six 
forms were all incorrect. Sixty-seven per cent 
consistency was defined as the condition in which 
five out of six of the similar forms were correct 
and thus one out of the six was incorrect.2 For 
33% consistency the reinforcement values of two 
out of the six forms were interchanged; finally, 
with 0% consistency half of the original group 
of similar forms was given opposite reinforce- 
ment. (Note that per cent consistency was 
actually the difference between the per cent 
correct and the per cent incorrect.) The zero 
common-segment condition constitutes a control. 
Although there is no “group of similar forms” 


2A possible confusion of the present design 
with “partial reinforcement” procedures should 
be clarified. The present use of “inconsistency” 
does not refer to a change in the reinforcement 
value of any form during the presentation period 
for a given group of Ss. Rather, the inconsis- 
tency lies in the fact that when the same response 
(choice) is made to a group of similar forms, this 
response for certain specific forms was never 
reinforced. 


(necessary if the concept of consistency is to be 
meaningful) the reinforcement values of specified 
proportions of forms were interchanged in a 
manner identical to the other similarity condi- 
tions to test for the hypothesized convergence 
of the functions. 

Procedure-—In terms of the primary inde- 
pendent variables the experiment was factorial 
in design, i.e., each level of one variable was 
tested with each level of the other variable. In 
all, 16 experimental conditions were involved, 
since the two independent variables were each 
manipulated in four degrees. 

The testing procedure consisted of 96 slide 
presentations. Each of 12 slides was shown 
eight times, or each pair 16 times. The 12 slides 
appeared in a different order in eight consecutive 
12-trial blocks according to a prearranged order 
of presentation. The sequence of positions for 
the correct form of each pair (the correct form 
was indicated by the position of a signal light) 
was predetermined such that the number on 
the right and left was equated and systematic 
sequences were avoided. With this arrangement 
the pairs of forms with interchanged reinforce- 
ment values, i.e., the “inconsistent” instances, 
appeared twice in each successive block of 12 
trials (once on the right and once on the left). 
However, the sequence of trials was arranged to 
avoid systematic “massing” or “spacing” of 
these inconsistent instances. 

The assignment of Ss to the various experi- 
mental conditions could not be completely ran- 
dom since four Ss were tested at one time. The 
Ss were assigned in such a way, however, that 
systematic differences due to individual differ- 
ence variables were balanced among the primary 
conditions of the experiment. 


RESULTS 


Tables 1 and 2 show the mean total 
error and mean response latency under 











30 ROBERT STANTON FRENCH 


TABLE 2 


Latency Means anp STANDARD Deviations (1n Seconps) 
(N in each group =24) 

















Number of Common Segments 
Consistency 
<a 0 ! : 
forcement 7 
Mean SD Mean SD Mean SD Mean SD 
0% 2.09 0.37 2.09 0.37 1.93 0.48 mae 0.33 
33% 1.89 0.37 1.99 0.44 1.87 0.39 2.06 0.60 
67% 2.14 0.47 1.52 0.42 1.36 0.50 1.41 0.43 
100% 1.86 0.29 1.04 0.43 0.83 0.27 0.70 0.12 























each of the experimental conditions. 
Essentially the same data are shown 
in graphic form in Fig. 2 and 3. 

The close approximation of the 
results to those hypothesized is imme- 
diately apparent. A set of converging 
functions was obtained which repre- 
sents the interaction between number 
of common segments and consistency 
of reinforcement. With high simi- 
larity between the forms of a class 
(three common segments) variations 
in consistency of reinforcement from 
0% to 100% are seen to vary discrimi- 
nation-learning performance, as meas- 
ured by mean total error in 96 trials, 
from 46.83 (slightly above chance) to 
.9. This is almost the entire range of 
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Fic. 2. Mean total errors for the 96-trial test- 


ing period as a function of number of common 
elements and percentage consistency of reinforce- 
ment 


learning performance possible under 
the present conditions. As the simi- 
larity within the class of forms is 
decreased, the range of performance is 
seen to decrease progressively. Fin- 
ally, with “no” similarity within the 
class of forms (zero common seg- 
ments), variations in consistency of 
reinforcement produce what are found 
to be chance differences in perform- 
ance error. For the latency measure 
the picture is somewhat less clear for 
the 0% consistency condition. Pos- 
sibly the group testing situation and 
the limitation on response time to 6 
sec. are responsible for the “ceiling” 
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function of number of common elements and 
percentage consistency of reinforcement. The 
latencies for each S were averaged for the 96- 
trial testing period and the mean of this averaged 
value for each of the 24 Ss obtained. 
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at about 2 sec. apparent in the data of 
Fig. 3. 

Separate analyses of variance were 
performed for the error and latency 
data. In brief, for both the error and 
latency analyses number of common 
segments, consistency of reinforce- 
ment, and the interaction of these 
variables were all highly significant 
(p < .001).* 

In lieu of separate t-test compari- 
sons, a single confidence interval at 
the .05 probability level was estab- 
lished for the error and latency data 
separately. As an estimate of the 
universe variance (s?) the mean square 
for Ss from the corresponding analysis 
of variance was used. For the error 
data, a difference between means of 
5.76 or more was determined to be 
significant at the .05 confidence level. 
Thus, as may be seen from Table 1, 
with three common segments the 
smallest mean difference is significant 
at this level while with zero common 
segments the largest mean difference 
does not achieve significance. For 
the latency data, a difference between 
means of .24 was determined to be 
significant. From Table 2 it is seen 
that at the three common-segments 
level the smallest mean difference is 
significant while at the zero common- 
segments level two comparisons just 
achieve significance. The conclusion 
is justified, therefore, that both in the 
over-all analysis and for more detailed 
comparisons the experimental hy- 
pothesis has been substantiated un- 
equivocally. 


3 It is evident from an examination of Tables 1 
and 2 that with such a wide range of performance 
an appreciable correlation between the means 
and SD’s is present; therefore, the assumption of 
homogeniety of variance is quite likely not satis- 
fied. In view of the extremely high F ratios for 
consistency of reinforcement (in the error anal- 
ysis 133; in the latency analysis, 104; with df in 
both cases, 3/368) the question of homogeneity 
of variance appears to be somewhat academic. 


Discussion 


The effect of increasing the number 
of common segments within a group of 
stimulus forms has been shown to be a 
function of the proportion of these 
stimuli designated “correct” relative 
to the proportion designated “‘incor- 
rect.” The present study would seem 
to provide an experimental framework 
for a systematic approach to a vari- 
ety of discrimination-learning and 
concept-formation tasks. In addition, 
the results would seem to have theo- 
retical implications for both discrimi- 
nation-learning and transfer-learning 
situations in which the communality 
within groups of stimuli and the con- 
sistency of reinforcement relative to 
these groups may be specified or 
varied. 


On a molecular level the concept of stimulus 
generalization may be used to account for the 
results. According to the present view, the 
increase in the number of common segments 
may be assumed to have increased the amount of 
stimulus generalization among the stimuli. 
When the entire group is “correct,”’ such gen- 
eralization is undoubtedly facilitative in its effect. 
Such facilitation would certainly be suggested 
by the small number of trials required to learn 
the “correct” forms with three common segments 
and 100% consistency of reinforcement. With 
less than 100% consistency it appears reasonable 
to suppose that the trials on which some propor- 
tion of the original group of stimulus forms are 
presented as “incorrect” represent, in a sense, 
experimental extinction of the response which 
has been made to the majority of the stimuli. 
Thus, as conceived here, variation in consistency 
of reinforcement changes the relative weights of 
a facilitative and an inhibitive factor. This 
inhibitive factor is assumed to generalize among 
all of the group just as the facilitative factor 
does. Accordingly, the probability of a “cor- 
rect” response is conceived to be, in part, a func- 
tion of the algebraic summation of these opposing 
effects, each weighted according to its propor- 
tional representation.‘ As the proportion of 


‘It is not clear to what extent the specific 
concepts employed by Hull and his associates 
are applicable to the present discrimination- 
learning situation. In general, however, the 
present formulation appears consistent with rein- 
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“correct” and “incorrect” stimuli approaches 
one-half, the magnitude of the inhibitive factor 
is assumed to approach that of the facilitative. 
One would expect task difficulty to increase, 
therefore, whenever consistency of reinforcement 
is decreased, assuming some generalization among 
the stimuli in question. 

The interaction between the primary variables 
is revealed by the fact that the amount of change 
in task difficulty with variation in consistency 
of reinforcement is itself a function of degree of 
similarity. One possible explanation for this 
interaction is in terms of an analysis of the total 
habit strength for each stimulus form into pro- 
portions associated with unique and common 
elements. The probability of a response to such 
a complex stimulus is conceived to be a function 
of some sort of summation of individual habit 
strengths for elements within the form. It is 
assumed that only the common elements produce 
generalization to any measurable extent. It is 
reasonable, therefore, that only the proportion 
of the total habit strength associated with 
common elements should be affected by variation 
in consistency of reinforcement. As this propor- 
tion is decreased, the effect of variation in con- 
sistency of reinforcement is decreased likewise. 
In the present study the proportion of unique to 
common elements was varied directly, since the 
total number of elements was held constant. It 
should be evident, however, that wherever there 
is an increase in communality or similarity 
among a group of stimuli, either in the laboratory 
or in nature, there is ordinarily a concomitant 
decrease in uniqueness or individuality. Even 
where the unique elements are not actually 
eliminated, they seem to become less “distinc- 
tive” as more and more common elements are 
added. An explanation of the interaction in 
these terms, therefore, would seem to have con- 
siderable generality. 

Assuming the present formulation in terms of 
stimulus generalization adequately accounts for 
the results, one might expect to get a similar set 
of interacting functions wherever degrees of 
stimulus generalization may be specified or 
varied with concomitant variation in consistency 
of reinforcement. It is not at all difficult to find 
examples in the literature of variations in stim- 
ulus similarity with results comparable to the 
present study; it is difficult, however, to find 
cases in which stimulus similarity and a variable 
comparable to consistency of reinforcement were 
manipulated at the same time. In the discus- 
sion which follows an attempt will be made to 
integrate the present approach with some of the 


forcement theory, i.e., Hull (5), and specifically 
with the analysis of discrimination learning in 
animals outlined by Spence (8, 9). 


alternative formulations which have been 
proposed. 

A number of theoretical integrations of the 
relationship of stimulus and response similarity 
to transfer effects have been formulated. The 
formulations of Bruce (1), Gibson (4), Osgood 
(7), Underwood (10), and Gagné, Baker, and 
Foster (3) are perhaps the most complete and 
most well known. Though not always described 
in such terms, to the present writer each of these 
formulations appears to be concerned, at least in 
part, with the interaction between stimulus simi- 
larity and a variable most frequently identified 
as response similarity. Let us examine this 
latter variable more closely. Obviously the 
response of S cannot be an independent variable 
manipulated directly by E. Nor is it likely that 
the writers using this term in connection with the 
paired-associates procedure conceive of response 
similarity in this manner. It would most likely 
be agreed that the so-called response similarity 
variable is actually variation in the response 
characteristics which are considered “correct” or 
which are otherwise encouraged. When con- 
sidered in this sense, response similarity is clearly 
a reinforcement variable and may be manipulated 
assuch. By changing response similarity in the 
transfer task, E can change the degree to which 
the original response is still reinforced, or, in a 
sense, the proportion of the original response 
which is reinforced relative to the proportion 
which is extinguished (assuming response simi- 
larity has been quantified). 

In the present discrimination task the mem- 
bers of a class of similar stimuli are presented 
repeatedly and consistency of reinforcement is 
defined in terms of the proportion of the stimuli 
for which the same response is reinforced. In 
the transfer situation, on the other hand, using 
the paired-associates procedure, each class of 
similar stimuli is ordinarily represented by a 
single stimulus item in the original and transfer 
lists. If the response items are identical for cor- 
responding stimulus items of the two lists, in 
effect, 100% of the original response is reinforced. 
If the material employed permits quantification 
in terms of number of common elements, the 
proportion of the original response which receives 
reinforcement in the transfer list may be decreased 
by degrees, i.e., response similarity may be de- 
creased. According to the present view this 
variation in the proportion of the response rein- 
forced produces, in terms of intervening process, 
an effect equivalent to variation in consistency of 
reinforcement as defined here. 

When similarity has been defined in terms of 
relative proximity of stimulus values (ordered 
along either a physical or response-defined 
dimension), it is perhaps less meaningful to 
speak of “the proportion of the original response 
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which is reinforced.” In a recent article, Morgan 
and Underwood (6) proposed that the effect of 
varying response similarity might be accounted 
for in terms of the concepts of response generali- 
zation and “parasitic reinforcement.” Accord- 
ing to these authors, at the same time that a 
given response is reinforced in the first list there 
is, in effect, “parasitic reinforcement” of similar 
responses. It is assumed that this reinforcement 
decreases in degree as the generalized response is 
less similar to the one actually reinforced. They 
reason that one would expect an increase in 
response similarity to increase the amount of 
associative facilitation, since some of the response 
items may already have corresponding response 
tendencies at a “supra-threshold” level. The 
explanation offered by Morgan and Underwood 
is essentially the same as that suggested here in 
terms of consistency of reinforcement. Accord- 
ing to the present formulation, however, when 
the original response is reinforced, in a sense, 
connections are strengthened between “elements” 
(or dimensional values) of both stimulus and 
response. To the extent that the transfer task 
duplicates these stimulus “elements,” the corre- 
sponding response “elements” will at least have 
a greater-than-chance probability of being 
evoked. The degree of associative facilitation 
or inhibition in the transfer task, therefore, is 
considered to be a function of the proportion of 
response “elements” which are reinforced rela- 
tive to the proportion extinguished. 

As further evidence for the view that response 
similarity and consistency of reinforcement are 
functionally equivalent, it may be shown, for 
example, that the “transfer and retroaction 
surface” proposed by Osgood (7) may be trans- 
lated directly into the form illustrated by Fig. 2 
in the present study. To make this translation 
it is only necessary to substitute consistency of 
reinforcement for response similarity and to take 
the point of convergence of the functions at zero 
common segments as the base line from which neg- 
ative and positive transfer effects are determined. 

Since the implications of the present study go 
far beyond the limits of the discrimination learn- 
ing task investigated here, the need is indicated 
for a separate theoretical discussion applying the 
concept of consistency of reinforcement both to 
original-learning and transfer-learning situations. 
The present study may be thought of as the 
empirical basis for this proposed theoretical 
formulation, the outlines of which have been 
sketched briefly. 


SUMMARY 


1. Using a discrimination-learning 
task with stimuli consisting of closed ir- 
regular contours, this study tested the 


hypothesis that discrimination-learn- 
ing performance is a joint function of 
(a) the number of common stimulus 
elements within a group of forms, and 
(b) the proportion of these similar 
forms for which the response of 
“choice” is reinforced or indicated as 
“correct.” According to the hypoth- 
esis, with high consistency of rein- 
forcement increasing the number of 
common elements decreases the diffi- 
culty of the discrimination task pro- 
gressively. As the consistency of 
reinforcement is reduced, however, the 
function describing decreasing diffi- 
culty changes slope progressively and 
is finally replaced by a function 
describing increasing difficulty. Thus, 
in effect, an interaction between 
the two independent variables was 
hypothesized. 

2. Each closed irregular contour 
was composed of three joined contour 
segments. Three-segment combina- 
tions of 18 basic segments were so 
arranged that the number of identical 
contour segments could be varied from 
zero to three within different groups 
of stimuli. In such groups, variation 
in four degrees was also introduced in 
the percentage of forms designated 
“correct” relative to the percentage 
designated “incorrect.” This latter 
manipulation defines variation in con- 
sistency of reinforcement. 

3. The discrimination task em- 
ployed was that of learning to choose 
on each trial the “correct” form in 
successively presented pairs of such 
forms. The forms in each pair were 
individually from each of two six- 
member groups of forms within which 
the number of common segments was 
varied. 

4. The procedure involved testing 
the discrimination performance for a 
96-irial testing period, with each of 
the 16 possible combinations formed 
by the four possible degrees of varia- 
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tion of the two independent variables. 
Twenty-four airman basic trainees 
each were assigned randomly to each 
of the 16 experimental conditions. 

5. The results constitute an experi- 
mental verification of the hypothe- 
sized interaction between number of 
common segments and consistency of 
reinforcement at a high level of 
confidence. 

6. A theoretical formulation of the 
results in terms of stimulus generali- 
zation is outlined. The apparent 
equivalence of consistency of rein- 
forcement and “response similarity” 
is discussed. 


(Received July 11, 1952) 
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STIMULUS GENERALIZATION AS A FUNCTION OF LEVEL 
OF EXPERIMENTALLY INDUCED ANXIETY! 


GERALD ROSENBAUM 


Wayne University 


In recent years, the term anxiety 
has been used in theoretical interpre- 
tations of behavioral phenomena to 
mean a hypothetical state or response 
accompanying or resulting from the 
anticipation of noxious stimulation. 
According to this formulation, first 
proposed by Mowrer (15), an implicit 
anxiety response becomes conditioned 
to previously neutral cues which have 
been associated with painful stimula- 
tion. The conditioning of this inter- 
nal response is assumed to provide an 
acquired motivational state in that 
the presence of anxiety cues leads to 
more vigorous activity and escape 
from these cues serves to reinforce the 
learning of new responses. Animal 
studies have been conducted by Miller 
(13), Brown and Jacobs (5), and 
Amsel (1), among others, to show that 
anxiety does exhibit both the reinforc- 
ing and energizing properties of a 
drive. 

The energizing function of drives is 
embodied in Hull’s theory (10) by the 
assumption that relevant and irrele- 
vant drives combine to multiply the 
habit strength of stimulus-response 
connections. Miller has utilized this 
postulate in his theoretical analyses of 
conflict (12) and displacement (14) to 


! This article is based on a dissertation sub- 
mitted to the faculty of the department of psy- 
chology of the State University of Iowa in partial 
fulfillment of the requirements for the Ph.D. 
degree. The writer wishes to express his grati- 
tude to Professors J. S. Brown and I. E. Farber 
for their guidance and to the staff of the Veterans 
Administration Hospital, Coatesville, Pennsyl- 
vania, for their cooperation during the course of 
this study. An abstract of this paper was pre- 
sented at the 1950 meeting of the Midwestern 
Psychological Association. 
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predict that variations in drive level 
will produce concomitant changes in 
the height of gradients of stimulus 
generalization. Since previous studies 
by Brown (3) and Rosenbaum (16) 
indicate that increases in the hunger 
drive result in a general elevation of 
performance measures of stimulus gen- 
eralization, similar results might be 
predicted for increases in the acquired 
anxiety drive. 

The present experiment was designed 
to investigate the hypothesis that stim- 
ulus generalization is a positive func- 
tion of level of experimentally induced 
anxiety. More specifically, the pre- 
diction was made, in accord with 
Hull’s postulates, that higher levels of 
anxiety would produce a greater 
strengthening of the response tendency 
to a conditioned stimulus and to 
stimuli differing from it along a given 
dimension. Performance measures of 
stimulus generalization would be ex- 
pected to reflect this heightened gra- 
dient of excitatory strength by an ele- 
vation in the gradients of response 
amplitude and frequency and by a 
lowering of the gradient of response 
latency. Although increases in drive 
should steepen as well as elevate the 
theoretical generalization gradient of 
excitatory strength, no precise predic- 
tion of the change in the slopes of the 
gradients of amplitude, frequency, and 
latency can be made from the func- 
tional relationships Hull postulates 
between these response measures and 
excitatory strength without specifying 
the prevailing habit strengths and 
amount of drive increase. Verifica- 
tion of the above hypotheses would 
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extend the correspondence between 
the notion of anxiety as an acquired 
drive and Hull’s general drive concept, 
as well as elaborating the relationship 
between different levels of anxiety and 
certain performance measures of stim- 
ulus generalization. 

In order to test these hypotheses, 
the following general procedure was 
employed. Human Ss were trained 
to make a simple motor reaction as 
quickly as possible upon the presenta- 
tion of a rectangularly shaped visual 
figure (conditioned stimulus). Other 
rectangles (generalized stimuli), which 
differed from the training stimulus in 
height and to which S was prohibited 
from responding, were then presented. 
The frequency, amplitude, and latency 
of “false” motor responses to these 
generalized stimuli were used as meas- 
ures of stimulus generalization. Three 
levels of anxiety were experimentally 
induced by training Ss to anticipate 
either a strong shock, a weak shock, 
or the sound of a buzzer when they 
responded slowly. It was assumed 
that Ss trained with more noxious 
stimuli would develop greater anxiety 
and exhibit a stronger tendency to 
respond to all stimuli presented. 


MetTHOD 


Experimental design—The experiment was 
conducted in two parts, one with psychiatric 
patients, the other with normal college students. 
In the first part, 72 male psychiatric patients 
from the Veterans Administration Hospital at 
Coatesville, Pennsylvania were divided into three 
groups: (a) strong-shock, (b) weak-shock, and 
(c) buzzer. Each group consisted of 24 Ss 
matched for psychiatric diagnosis. The second 
part of the experiment was an exact replication 
of this design, using 72 male students from the 
introductory psychology course at the State 
University of Iowa. The replication was intro- 
duced in order to determine whether the results 
initially obtained with pathological Ss could be 
duplicated with normal groups. 

Apparatus.—The two major functional ele- 
ments of the apparatus were the stimulus-presen- 
tation unit and the response key used byS. The 


stimulus-presentation device was constructed 
in such a way that any one of four rectangular 
patches of light could be presented by £. It 
consisted of a lamp box 8 in. high, 6 in. wide, and 
11 in. long, into one end of which was fixed an 
opal-flash glass panel (4.5 X 6.5 in.). A 60-w. 
lamp was mounted inside at the opposite end of 
the box. An opaque panel, into which four rec- 
tangular slots had been cut, could be slid back 
and forth behind the glass screen. When one of 
the rectangular slots was at the center of the 
box, light would pass through it to the screen, 
forming an illuminated image of the slot. The 
four vertically oriented rectangular images were 
.5 in. wide and varied in height from 2.25 to 3 in. 
in steps of .25in. A13 X 28-in. plywood screen, 
painted black and containing an aperture for the 
glass screen, was located in front of the lamp box 
to conceal E’s manipulation of the sliding panel 
from S. 

The response key consisted of a brass slider 
connected into an endless loop of string that 
passed in front of the lamp box and around to a 
movement-recording system at the rear. The 
slider was mounted on the same baseboard as the 
lamp box and located in front of the plywood 
screen. The Ss were instructed to grasp a knob 
(1 in. long and .5 in. in diameter) at the center 
of the slider and to make a simple movement to 
the right in the horizontal plane. The extent of 
the required movement was indicated by a white 
arrow painted on the response panel 7 in. from 
the starting point. The maximum possible 
movement of the slider was 9.25 in. The slider 
was connected through the endless loop of string 
to a wooden pulley in such a way that any move- 
ment of the slider caused the pulley to rotate. 
Two pointers were mounted on the axle of the 
pulley, one being rigidly attached to the axle, the 
other not. The second pointer would move only 
with the first and remain fixed at the termination 
of a movement, the first returning to the starting 
position with the slider. The amplitude of each 
movement could thus be read directly from the 
position of the second pointer on a calibrated 
scale. 

The latency of each response was recorded to 
the nearest .01 sec. by a Standard Electric timer. 
This timer and the 60-w. lamp in the stimulus- 
presentation unit were activated simultaneously 
by a doorbell-type switch in a holding-relay cir- 
cuit, so that the clock started at the same instant 
that the rectangular image was presented. 
Movements of the slider as small as .1 in. were 
sufficient to break the circuit to the clock and 
the lamp. The holding relay prevented reacti- 
vation of the timer and the lamp when the slider 
was returned to the resting position. If S did 
not respond within 5 sec., E could break the 
circuit by means of a separate switch. 
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The shocks were generated by a Harvard in- 
ductorium energized by 1.5 v., and were admin- 
istered through two copper electrodes fastened to 
a leather wrist band. The movable secondary 
coil of the inductorium was set at 5.0 cm. for 
the strong-shock condition and at 6.5 cm. for the 
weak-shock. Electrode paste was used to reduce 
variations in resistance. The sound of the induc- 
torium when activated served as the buzzer. 

Preliminary training and screening —The S 
was seated at the apparatus so that his right arm 
rested on an adjacent table and the top of the 
glass screen was at eye level. For Ss tested 
under shock conditions, the wrist band was then 
strapped on to the left wrist. All Ss were shown 
the required movements of the slider and given 
several practice trials paired with the training 
stimulus. In order to screen Ss incapable of 
performing the experimental task, the four 
stimuli were presented in a systematic order and 
S was tested for his ability to discriminate be- 
tween them. If any of the other rectangles were 
designated as the training stimulus more than 
three times in the course of 20 trials, S was elimi- 
nated from the experiment. Although none of 
the normal Ss failed to meet this criterion, six 
psychiatric patients were eliminated. 

Training and testing for generalization.—All Ss 
were instructed to move the slider as rapidly as 
possible when the training stimulus was 
presented and were prohibited from responding 
to any of the other three stimuli which they were 
told would occasionally appear. Half of the Ss 
in each anxiety group were trained to respond to 
the 3-in. rectangle and the other half were trained 
with the 2.25-in. rectangle in order to control 
for differential effects of the extensity of the train- 
ing stimulus upon the generalization gradients. 

Following the reading of instructions, Ss were 
given an initial series of 20 conditioning trials 
with the training stimulus. The training trials 
were followed immediately by the test series, 
which consisted of five test trials on each of the 
three generalized stimuli in addition to the train- 


ing stimulus. These 20 test trials were inter- 
spersed among 60 additional training trials, so 
that each test trial was followed by three presen- 
tations of the training stimulus. The training 
stimulus thus appeared 65 times during the test 
series, but was recorded as a test trial only on 
the five occasions on which it was preceded and 
followed by three presentations of the same 
stimulus. Interspersed training trials were em- 
ployed to reduce extinction effects which might 
have been produced by the interpolation of gen- 
eralized stimuli. Twelve different orders of pres- 
entation of the test stimuli were used in each 
group to minimize the effects of serial position. 

The three levels of experimentally induced 
anxiety were effected by varying both the nox- 
ious stimulation and the instructions. Accord- 
ingly, Ss in each of the groups were told that if 
they responded too slowly they would receive 
the following treatments: 


Strong-shock group—receive a painful 
shock 

Weak-shock group—feel a slight skin 
stimulation 

Buzzer group—hear a buzzer sound 


Shocks or buzzes were administered once during 
the preliminary instructions and on the sixth 
and eighteenth training trials, regardless of S’s 
reaction latency, in order to maintain the par- 
ticular anxiety level involved. 


RESULTS 


The major results of the experiment 
obtained from measurements of the 
amplitude and frequency of responses 
are presented in Tables 1 and 2. 
These tables show the mean response 
tendency to each of the four stimuli 
for the normal and psychiatric Ss 
under the three experimental condi- 


TABLE 1 


Mean AmpLitupE or RESPONSE IN INCHES TO THE TRAINING AND GENERALIZED STIMULI 
































Normal Ss Psychiatric Ss 
Group Rectangles Rectangles 
Mean - Mean 
0 | 1 | 2 | 3 0 | 1 | 2 3 

Strong shock 7.12 4.20 3.80 3.62 68 7.03 3.67 2.36 2.25 3.83 
Weak shock 6.75 2.72 1.77 1.17 3.10 5.81 2.45 1.78 1.75 2.95 
Buzzer 6.80 2.39 1.17 1.47 2.96 7.20 1.94 1.39 0.86 2.85 
All Ss 7.07 3.10 2.25 2.09 6.68 2.69 1 84 1.62 
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TABLE 2 


Mean PERCENTAGE FREQUENCY OF RESPONSES TO THE TRAINING AND GENERALIZED STIMULI 




















Normal Ss Psychiatric Ss 
Group Rectangles Rectangles 
Mean Mean 
0 1 | 2 | 3 0 | 1 | 2 | 3 

Strong shock 90.8 58.3 57.5 54.2 65.2 91.7 54.2 45.0 40.8 57.9 
Weak shock 83.3 37.5 26.7 24.2 42.9 77.5 36.7 35.0 31.7 45.2 
Buzzer 84.2 33.3 20.0 25.8 40.8 91.7 34.2 30.0 21.7 44.4 
All Ss 86.2 43.0 34.8 34.8 87.0 41.6 36.6 31.4 




















tions. The CS is designated as 0, 
regardless of whether it was the largest 
or smallest rectangle, since compari- 
sons of the gradients obtained revealed 
no essential differences attributable to 
the extensity of the CS employed; the 
other rectangles are designated as 1, 2, 
and 3 in order of decreasing similarity 
to the CS. Since the latency data 
were not representative of the results 
obtained from the other two measures, 
they are considered separately. 
Generalization gradients.—In order 
to test the hypothesis that heightened 
anxiety tends to elevate gradients of 
generalization, it is first necessary to 
demonstrate that empirical generali- 
zation gradients were obtained with 
the present procedure. Inspection of 
Tables 1 and 2 reveals the presence of 
fairly continuous decrements in per- 
centages and amplitude from the CS 
to the least similar rectangle in the 


TABLE 3 


ANALYSIS OF VARIANCE OF AMPLITUDE 
Data ror Normat Ss 











Source of Variation df — F 
Anxiety levels 2| 87.781 8.197* 


Ss in same group 69 | 10.709 


Stimuli 3 |364.595 | 187.926* 
Anxiety X stimuli 6} 7.855 4.049* 
Pooled Ss X stimuli |207| 1.940 

Total 287 














* Significant beyond the 1% level of confidence. 


series. This trend is apparent in al| 
of the groups presented in the tables. 

The data upon which these tables 
are based were subjected to analyses 
of variance, a sample of which is pre- 
sented as Table 3. Since the same Ss 
provided the data for the response 
measures to the different stimuli, the 
procedure recommended by Kogan 
(11) and Edwards (6) for analysis of 
variance involving repeated measure- 
ments on the same Ss was utilized. 
This procedure makes use of two 
separate error estimates, so that the 
variance produced by the uncorre- 
lated subjects in the three anxiety 
groups is evaluated by using an error 
term free of correlation, while the 
other major sources of variance which 
involve the repeated performance of 
the same subjects are evaluated by 
using an error term which takes the 
correlation within Ss into considera- 
tion. It may be noted from Table 3 
that, for normal Ss, the variance in 
amplitude attributable to the four 
stimuli is highly significant. This 
finding is clearly in accord with the 
evidence for generalization derived 
from examination of the tabular data. 

A graphic plot of the over-all gra- 
dient of response amplitude, obtained 
by pooling the data from the three 
normal groups, is shown as the lower 
solid line on Fig. 1. Comparable 


results were obtained from an analysis 
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of the patients’ amplitude data and 
the resulting gradient has been plotted 
as the lower dashed line on Fig. 1. 
The upper curves on Fig. 1 represent 
the over-all gradients for both groups 
when only the positive responses are 
employed in computing the mean 
amplitudes, zero amplitudes (corre- 
sponding to failures to react) being 
excluded from the computations. It 
may be seen that both sets of empirical 
curves have a highly similar nega- 
tively accelerated form characteristic 
of generalization gradients. The con- 
sistently lower gradient of response 
amplitude for the psychiatric group 
may be due to the large percentage of 
schizophrenic patients in that group, 
since a generally reduced level of 
responsiveness is characteristic of such 
individuals. 

Significant values of ¢ obtained 
between the means of Stimuli 1 and 2 
and between Stimuli 1 and 3 on both 
gradients revealed that the significant 
amount of variation between stimuli, 
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Fic. 1. Empirical gradients of response ampli- 
tude for all normal and psychiatric Ss 
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Fic. 2. Gradients of response amplitude’ for 
the three levels of experimentally induced anxi- 
ety in normal Ss 


noted in Table 3, could not be attrib- 
uted solely to the difference in response 
amplitude between the training stim- 
ulus and the prohibited stimuli. It 
may also be noted that the presence of 
clear-cut gradients of amplitude when 
zero amplitudes are excluded, pre- 
cludes the possibility that the results 
obtained for the amplitude measure 
might have been an artifact of the 
frequency gradients. 

Examination of Table 2 shows that 
a pooling of the percentage frequency 
of responses for all Ss yields gradients 
of frequency which closely approxi- 
mate the negatively accelerated curves 
plotted from the amplitude data. 
Analysis of variance and ¢ tests of 
these data yielded results parallel to 
those for amplitude. The results of 
these analyses indicate that when Ss 
have been trained to make a motor 
reaction to a particular visual stim- 
ulus and are then tested on other 
stimuli, to which they have been 
instructed not to respond, generaliza- 
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tion gradients in amplitude and fre- 
quency of response are obtained. 

Anxiety levels —Table 3 indicates 
that the variance in amplitude at- 
tributable to the three levels of 
experimentally induced anxiety was 
significantly greater than chance. 
The generalization gradients of re- 
sponse amplitude for the normal groups 
receiving strong shock, weak shock, 
and buzzer are presented in Fig. 2. 
Analysis of the over-all heights of the 
curves showed that the strong-shock 
group exhibited significantly more 
generalization than the other two 
groups (1% level), while the gradients 
obtained for the weak-shock and 
buzzer groups were not significantly 
different. 

Analysis of the amplitude data ob- 
tained from the psychiatric patients 
yielded results comparable to those 
obtained from the normal Ss. Inspec- 
tion of Table 1 reveals that the over- 
all gradient for the strong-shock group 
was again markedly higher than those 
obtained for the other two groups. 

The effects of induced anxiety upon 
the gradients of frequency may be 
seen in Table 2. Statistical treat- 
ments of these data indicated that 
strong shock produced a significant 
elevation of the gradient of frequency 
over the gradients obtained from the 
other two conditions in both the nor- 
mal and psychiatric groups. 

Examination of Fig. 2 suggests that 
strong shock, in addition to elevating 
the generalization gradient, tends to 
flatten it. A statistical test of this 
effect involves an evaluation of the 
differential effects of the three levels 
of experimentally induced anxiety at 
the four points along the stimulus 
dimension. This evaluation is ac- 
complished in the analysis of variance 
by the F ratio attributable to the 
interaction between anxiety level and 
stimuli, shown in Table 3 to be signifi- 


cant at better than the 1% level. 
Equivalent flattening of the gradient 
with increases in anxiety level was also 
found for the psychiatric Ss and was 
present in the frequency data of both 
groups. 

This finding is in accord with the 
common clinical observation that high 
levels of anxiety tend to impair dis- 
crimination, since accuracy of discrim- 
ination between stimuli should be a 
function of the steepness of the gen- 
eralization gradient rather than its 
absolute height. It should be noted, 
however, that the present findings 
may have resulted from the artificial 
maximum imposed on the measures 
of response amplitude and frequency 
used in this experiment. In terms of 
Hull’s theoretical assumptions, some 
flattening of the empirical generaliza- 
tion gradients should accompany drive 
increases whenever the observed re- 
sponse to the conditioned stimulus is 
near its physical maximum, as in- 
creases in excitatory strength would 
then tend to elevate only the responses 
to generalized stimuli. 

Measures of latency—The median 
latency of response for the five test 
trials on each of the rectangles was 
computed forevery S. Table 4 shows 
means of these individual median reac- 
tion times. An inspection of these 
data reveals that reaction latency 
tended to decrease, rather than in- 
crease, with greater disparity between 
the training and test stimuli. This 
trend would seem to indicate that 
response strength, as estimated from 
latency, tends to become greater 
as the similarity of the generalized 
stimulus to the training stimulus 
decreases. A finding to this effect is 
obviously the reverse of the generali- 
zation phenomenon obtained from the 
amplitude and frequency measures. 
However, since latency and frequency 
are positively related (short mean 
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TABLE 4 


Mean oF Mepian Latencies oF Response In SECONDS 
TO THE TRAINING AND GENERALIZED STIMULI 


























Normal Ss Psychiatric Ss 
Group Rectangles Rectangles 
Mean Mean 
0 1 2 3 0 | 1 | 2 3 
Strong shock 39 31 34 30 34 55 42 45 29 43 
Weak shock 56 46 24 24 38 54 27 32 25 34 
Buzzer AI 42 34 29 38 61 60 46 22 47 
All Ss 48 40 31 .28 57 43 41 25 


























latencies being associated with low 
mean frequencies), it is apparent that 
the number of measures used in the 
computation of the individual latency 
scores decreased with the gradient of 
frequency. This means, in effect, that 
the number of indeterminately long 
latencies (corresponding to failure to 
react) increased with the distance 
along the stimulus continuum from 
the training stimulus. Had it been 
possible to enter these long latencies 
into the computations, the direction 
of the resulting gradients would quite 
probably have been the same as those 
obtained for amplitude and frequency. 
This explanation obviously does not 
account for the fact that the latencies 
which did enter into the computations 
yielded a gradient which was the 
reverse of generalization, rather than 
assuming some other form. 

Since generalization gradients were 
not obtained from the latency data, it 
was not possible to use this measure 
in testing the hypothesis that amount 
of generalization is positively related 
to anxiety level. However, the effect 
of anxiety level upon over-all reaction 
latency was considered by computing 
the median latency to all test stimuli 
for each S and comparing the three 
groups. The results of this analysis 
showed that the mean latency for 
the strong-shock Ss was significantly 


shorter than the mean for weak shock 
at the 5% level in the normal groups. 
No other significant effects were ob- 
served as these data were extremely 
variable, particularly in the psychi- 
atric groups. 


Discussion 


It has been shown in the present 
study that when Ss have been trained 
to make a voluntary motor response to 
a particular rectangle and then tested 
on other rectangles, the strength of 
the response tendency to the test 
stimulus varies inversely with the 
distance, along a size dimension, be- 
tween that stimulus and the training 
stimulus. Voluntary responses have 
previously been employed in studies 
of generalization by Gibson (7) and 
by Brown, Bilodeau, and Baron (4). 
Both of these studies have reported 
negatively accelerated gradients of 
response frequency which were quite 
similar to the gradients of frequency 
and amplitude obtained in the present 
investigation. ‘The empirical genera- 
lization curves obtained in all three 
studies have shown a close corre- 
spondence with those obtained by 
Hovland (9) and Bass and Hull (2) 
for the conditioned galvanic skin 
response. This correspondence lends 
support to the assumption that the 
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generalization process is similar for 
both voluntary and conditioned re- 
sponses. 


In all three studies of voluntary responses, 
however, the latency gradients were in the oppo- 
site direction of those obtained from generali- 
zation of conditioned responses. This finding 
illustrates one essential difference between the 
two generalization phenomena, namely that gen- 
eralization gradients obtained in studies employ- 
ing voluntary responses result from the combined 
action of long-latency, verbally mediated re- 
sponses and short-latency, relatively unmediated 
responses. The inverted gradient of latencies is 
produced by the presence of mediated reactions 
to stimuli similar to the training stimulus result- 
ing in long mean latencies, while mediated 
response tendencies toward less similar stimuli 
result in failures to react and short mean laten- 
cies. Since the tendency to label generalized 
stimuli as “incorrect” is produced by the instruc- 
tions prohibiting responses to these stimuli, the 
mediated tendency to inhibit reactions should be 
constant for all such stimuli. Differences in the 
frequency of unmediated responses to the stimuli 
along the generalization continuum would then 
be a function of primary generalization of differ- 
ential amounts of excitatory strength, produced 
by the practice with the training stimulus. How- 
ever, a greater frequency of long-latency, medi- 
ated reactions to stimuli similar to the training 
stimulus should also be expected, since any 
implicit responses which were present when a 
positive reaction was made to the training stim- 
ulus should also generalize and serve to mediate 
reactions to similar stimuli. a 

In effect, this formulation suggests that the 
generalization of voluntary responses involves 
not only a decreasing primary gradient of response 
strength toward less similar stimuli, but, in addi- 
tion, a parallel gradient of mediated response 
tendencies resulting from the secondary generali- 
zation of responses to the perceptual and verbal 
response-produced cues associated with the stim- 
uli along the generalization dimension. A pre- 
diction which might be generated by this formu- 
lation is that a more conventional generalization 
gradient of latencies would be obtained if the 
tendency to mediate voluntary reactions were 
minimized by a systematic emphasis on speed. 

In general, the predictions derived from the 
relationship of the drive concept of anxiety to 
stimulus generalization were substantiated in the 
presentexperiment. Although the noxious stim- 
uli (shocks or buzzer) were presented only three 
times, relatively early in the course of a hundred 
trials, a marked difference in the level of respon- 
siveness to similar stimuli was noted between the 


strong-shock and the other conditions. The dif- 
ference obtained between the strong- and weak- 
shock conditions suggests that the rate of rise of 
the conditioned anxiety response is more rapid 
in situations involving strong shock. Since the 
number of anxiety conditioning trials was held 
constant, the anxiety response should have 
reached a higher maximum in the strong-shock 
condition. The fact that no difference was ob- 
tained between the weak-shock and buzzergroups 
suggests that the maximum reached by the 
anxiety response under these conditions was 
either zero or a small positive value. These 
results are in accord with the findings of Gwinn 
(8) that the strength of the fear response is a 
function of the intensity of the noxious stimulus. 

The present study has shown that the amount 
of stimulus generalization, as measured by the 
over-all heights of the frequency and amplitude 
gradients, is positively related to the degree of 
experimentally induced anxiety. Since similar 
elevations of generalization gradients have been 
observed with other drive states, the heightened 
responsiveness of anxious individuals may be 
regarded as one instance of the more general 
effects of strong drive states upon the generali- 
zation of response tendencies. 


SUMMARY 


The present experiment was con- 
cerned with the effects of three levels 
of experimentally induced anxiety 
upon the generalization of a voluntary 
response to visual stimuli. It was 
hypothesized that higher levels of 
anxiety would produce corresponding 
elevations of the _ generalization 
gradients. 

Seventy-two male psychiatric pa- 
tients and an equal number of college 
students were each divided into three 
anxiety levels, effected by the use of 
two different intensities of shock and 
a buzzer. All Ss were trained to make 
a motor reaction to a visual stimulus 
and then tested on other stimuli, dif- 
fering in height from the training 
stimulus. The frequency, amplitude, 
and latency of the responses to these 
generalized stimuli were employed as 
measures of generalization. 

The results showed that the shapes 
of the generalization gradients for fre- 
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quency and amplitude were highly 
similar in both the normal and psychi- 
atric groups. Latency of response 
yielded the inverse of a decreasing 
gradient of response strength. How- 
ever, it is suggested that a generaliza- 
tion gradient would be obtained with 
this measure if indeterminately long 
latencies were considered. 

Strong shock resulted in significantly 
higher generalization gradients than 
either weak shock or buzzer with both 
normal and psychiatric Ss. No dif- 
ferences were found between the effects 
of a weak shock and a buzzer. The 
gradient was flatter under the strong- 
shock condition than under the other 
two conditions. 

It was concluded that experimen- 
tally induced anxiety shows the ener- 
gizing function of a drive in elevating 
generalization gradients. The dis- 
crepancy between the latency data 
and the results obtained with other 
measures is discussed and interpreted 
in terms of the effects of mediated proc- 
esses upon generalization gradients 
obtained with voluntary responses. 


(Received July 16, 1952) 
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THE ROLE OF INSTRUCTION IN EXPERIMENTAL 
SPACE PERCEPTION ! 


CECIL W. MANN AND RANDOLPH O. BORING? 


Tulane University 


Perceptions and the descriptions of 
perceptions are selective processes 
depending upon a number of factors, 
some of which are mental set, train- 
ing, experience, and purpose. The 
perception of a landscape by an artist, 
a poet, a forest ranger, and a house 
builder will be different, and so will 
be their descriptions. The testimony 
given in a court of law by a witness 
answering specific questions will prob- 
ably differ from that given when he 
describes events in response to the 
unstructured request to tell what 
happened. 

In evaluating perceptual experi- 
ments it is often necessary to know to 
what extent the results have been 
influenced by selective factors. In 
order to clarify this situation, it is 
proposed to group perceptual experi- 
ments into two types. Type X will 
include those experiments in which S 
performs and E observes the per- 
formance. In the Rorschach experi- 
ment, for instance, the directions to S 
are designed to keep him at the level 
of a naive observer. It is E who is 
the observer for purposes of this 
experiment, and it is he who needs 
sophistication by training and experi- 
ence to the point where he can make 
significant observations. Similarly, in 


1 This report is one of a series of investiga- 
tions conducted jointly with the U. S. Naval 
School of Aviation Medicine under Contract 
N7-onr-434 T.O.1, with the Office of Naval 
Research, in cooperation with the Bureau of 
Medicine and Surgery, and the Tulane Univer- 
sity of Louisiana. 

2?The senior author is responsible for the 
article in its present form. Randolph O. Boring 
modified the equipment and collected the data. 
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a card-sorting experiment, S needs to 
be sophisticated merely to the point 
at which he knows the rules of the 
game; E makes the observations and 
must be trained in the selection of 
relevant details for observation. 

Type Y experiments are those in 
which S is the observer as well as the 
performer. He is faced with the task 
of observing the effect of the variation 
of some condition upon his perceptual 
processes. In experiments of this 
type, E retires to the position of 
mechanic who sets the conditions of 
the equipment or clerk who records. 
In measuring auditory threshold, for 
instance, E manipulates frequencies 
and intensities, S makes the observa- 
tions. The E has no subjective check 
on what S perceives; he cannot listen 
for S. His check will be objective: 
the degree of consistency of reports by 
the observer upon replication of the 
conditions. 

It is in the latter experiments that 
the selective factors operating on S 
become important. If the effect to be 
observed is complicated, ill-defined, or 
confused by other or similar effects, 
the consistency of the reports may be 
greatly reduced. Reports of the oc- 
ulogyral effects illustrate this point. 
Very few naive Ss report the obser- 
vation of all effects during and follow- 
ing rotation prior to the fifth or sixth 
rotational trial, and some Ss require 
ten or twelve trials before they are 
able to make full reports. The reports 
of the duration of the effects vary 
considerably during the first few trials, 
and thereafter increase in consistency 
with their selective perception. 
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The effect of selective perception is 
illustrated by a series of experiments 
designed to determine the precision 
of the judgment of the vertical under 
varying conditions of visual and body 
tilt. It has been variously argued 
that under conflicting visual-proprio- 
ceptive conditions the determinants of 
the perceived vertical will be the 
visual factors (5, 6) or the proprio- 
ceptive factors (3, 4). In an attempt 
to resolve this confusion, Gibson (2) 
has recently suggested that more 
emphasis be placed upon the joint 
interaction of the factors in the visual- 
proprioceptive complex. He argues 
that when the main lines of the retinal 
field are in alignment with the direc- 
tion of the gravitational force, the 
visual factors will interact with the 
compensatory mechanism of the stato- 
cysts to produce an invariant resul- 
tant. Under this condition of con- 
sistent variables the perception of the 
vertical will be precise. This condi- 
tion is produced whether the body is 
upright or tilted, for the tilt of the 
body does not change the direction of 
gravity, albeit the force does pass 
through the body at an unusual angle. 
When the retinal framework is not in 
alignment with the gravitational force 
—because of the rotation of the visual 
field by mechanical or optical meth- 
ods, or by the addition of a centrifugal 
component to that of gravity—the 
condition is discrepant, and the visual- 
proprioceptive variables are in con- 
flict. The judgment of the vertical 
under this condition will be less precise. 


PROCEDURE 


In the design of this experiment the Tulane 
lateral-tilt chair was used with the addition of a 
small visual framework placed at eye level at a 
distance of 5 ft. in front of S. Two frameworks 
were used: (a) a luminescent straight line } in. 
wide and 14 in. in length; and (b) a luminescent 
window pattern 12 X 14 in. bisected vertically 
and horizontally. The chair and framework 








could be tilted independently to produce con- 
sistent or discrepant conditions. The S was 
required to set a selsyn-controlled target rod, 
} X 12 in. to the direction of the gravitational 
vertical while maintained by £ in the preset 
consistent or discrepant condition. Three angles 
of inclination were used for chair and framework, 
30°L, 0°, and 30°R, and Ss made five judgments 
in each of the nine conditions, arranged in ran- 
dom order. The target rod was set at 30°R and 
30°L, an equal number of times in random order. 
The trials were all made in the absence of visual 
cues. The S closed his eyes while E set the con- 
ditions, and, upon instructions, opened his eyes 
and made his judgment by adjusting the target 
rod to the gravitational vertical. The S had no 
control of the tilt of the chair or the framework, 
but could move the target rod in either direction 
by the use of a small switch close to his right 
hand. The framework and the chair could be 
set by E to an accuracy of 1°; the target rod was 
geared to a recorder and could be read to an 
accuracy of .1°. 

It will be noted that under some conditions S 
has to make a choice of one of three verticals. 
For instance, when his body is tilted at 30°L and 
the visual framework at 30°R, he may (a) accept 
the tilt of his body and set the target rod in 
alignment with his body or in a position that he 
believes will compensate for the tilt, (b) accept 
the visual framework and align the target rod 
thereto, or (c) attempt to set the target rod in 
alignment with the upright walls of the room 
housing the tilt chair. Furthermore, in any 
series of trials, unless he is constantly aware of 
his criterion of verticality, he may change his 
choice from one to another. One would predict 
that naive Ss with relatively unstructured direc- 
tions would be less precise and more variable 
than Ss sophisticated with respect to the crite- 
rion. In order to test this hypothesis, the Ss— 
undergraduates ranging from 18 to 20 yr.—were 
divided into two groups with four in each group. 

Nawwe group.—Each member of the naive 
group was brought into the darkened room, and 
was told that the problem was to set the target 
rod to the vertical. He was shown how to con- 
trol the movement of the target rod. No instruc- 
tions were given with respect to what constituted 
verticality, and if he asked questions, he was told 
to use his own judgment in setting the target rod. 

Sophisticated group—Each member of this 
group was brought into the lighted room. He 
was seated in the chair and shown how to control 
the target rod. He was told that his problem 
was to align the target rod to the gravitational 
vertical, that is, either parallel to the walls of the 
room, or at right angles to the floor. He was 


then given a few trials in the lighted room in 
order to make sure that he was aware of the 
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TABLE 1 


Mean AVERAGE ERRORS IN THE JUDGMENT OF THE VERTICAL FOR NAIVE AND SOPHISTICATED 
Groups Unper Dirrerent Conpitions or Titt or VIsuaL 
Frameworks I anp II anp Bopy Titt 












































ner Frame 30°L Frame 0° Frame 30°R 
Position | Group 
I Il I II I II 
30°L Soph. 3.0 2.4 3.4 4.6 2.0 1.8 
Naive 17.0 16.3 11.8 12.0 4.4 2.4 
0° Soph. 1.7 3.3 1.4 1.8 1.4 2.0 
Naive 7.9 7.8 2.8 2.3 8.1 10.3 
30°R Soph. 3.4 1.6 3.5 2.5 3.9 5.9 
Naive 4.7 5.4 13.6 12.0 20.0 21.4 
criterion of verticality. He was encouraged to (1,4). More important in this paper 


ask questions, and insofar as these were pertinent 
to thecriterion of verticality, they were answered. 
Every effort was made in giving directions to 
ensure that the naive group maintained its own 
criterion of verticality and that the sophisticated 
group accepted that of the experimenter. 


RESULTS 


Mean gross average and mean con- 
stant errors are shown for each group 
and for each visual framework in 
Tables 1 and 2, respectively. It is not 
proposed to discuss here the almost 
negligible influence of the visual frame- 
work upon the judgments of the 
sophisticated group since these results 
have been fully dealt with elsewhere 


is a comparison of the results of the 
naive and the sophisticated groups. 
It will be seen that in every compari- 
son, even in those where the stimulus 
variables are consistent, the errors 
made by the naive group are larger 
than those of the sophisticated group. 

An analysis of the constant and 
average errors of the sophisticated 
group shows homogeneity of variance 
and differences between conditions 
and subjects that are not significant. 
An analysis of the results of the naive 
group would not appear to be useful 
because of the great variability. For 
instance, two Ss made gross average 


TABLE 2 


Mean Constant Errors IN THE JUDGMENT OF THE VERTICAL FOR NAIVE AND SOPHISTICATED 
Groups Unper Dirrerent ConpitTions oF Titt or VIsuAL 
Frameworks I anv II anp Bopy Titt 
































+A | Frame 30°L Frame 0° Frame 30°R 
Position | Group | : 
| I II I II I II 
30°L | Soph. | LOL 2.0L 3.5L 4.4L 0.9L 1.5L 
| Naive | 143L 8.3L 8.4L 11.4L 0.5L 1.3L 
0° | Soph. | 1.6L | 2.8L 1.0R 0.4L 0.7R 1.2R 
| Naive | 5.7L | 631 1.9R 1.8R 7.6R 9.2R 
30°R | Soph. | 34R | OSR 1.6R 0.9R 3.1R 4.4R 
| Naive | 3.7R | 2.6L 13.6L 10.3L 18.3R 20.3R 
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errors of more than 50°, while under 
the same condition the other two Ss 
made gross average errors of less than 
5°. Under two-thirds of the condi- 
tions represented in Tables 1 and 2, 
mean values for naive Ss are mislead- 
ing because of the wide range of errors. 


Discussion 


The prediction that sophisticated 
Ss will make more precise and less 
variable errors than naive Ss is borne 
out by the results. The magnitude 
of the variance of the errors of the 
sophisticated group permits the infer- 
ence that members of this group 
accepted and used the criterion of ver- 
ticality demonstrated by E£. The 
variance of the errors of the naive 
group does not permit this inference 
to be drawn, and inspection of the 
individual results indicates that mem- 
bers of this group failed to maintain 
any single criterion of verticality, and 
that some Ss changed their criterion 
even under the same condition. 

These results throw light upon the 
discrepant results reported by Witkin 
and Asch (6) and by Mann (4) in their 
series of experiments on the judgment 
of the vertical. Witkin uses the con- 
flicting visual-proprioceptive tilt chair- 
room conditions as an unstructured 
stimulus situation in which the rela- 
tively naive S is required to choose a 
vertical. This sets up an ambiguous 
situation forcing S to make his choice 
from one or more apparent verticals 
or from the gravitational vertical. 
Under this condition, the choice is 
likely to be influenced by attitudes, 
interests, and by personality differ- 
ences. The wide range of errors and 
the significant individual differences 
reported by Witkin and Asch are 
probably a function of the level of 
instructions given S. 

In the experiments reported by 


Mann, emphasis during the instruc- 
tional period is placed upon the gravi- 
tational vertical. This is demonstrated 
by E, the S practices setting the gravi- 
tational vertical, and throughout the 
experimental series he is frequently 
reminded to maintain this as his crite- 
rion of verticality. The purpose of 
these experiments is to determine the 
effect of variation of visual and 
proprioceptive stimuli upon the judg- 
ment of the gravitational vertical, and 
the instructions are designed to pro- 
duce and maintain in S the direction 
of the gravitational vertical as the 
criterion of judgment. Under this 
more highly structured condition the 
situational choice is restricted, and the 
errors reported in Mann’s series are 
smaller and less variable than those 
reported by Witkin and Asch. 

The experimental series reported by 
these two investigators over the past 
few years using similar equipment and 
apparently similar design turn out to 
be different in purpose, divergent in 
method, and productive of dissimilar 
results and conclusions. A compari- 
son of the results of naive and sophis- 
ticated Ss used in this experiment 
indicates that the discrepant results 
reported in these two series of experi- 
ments are probably due to differences 
in levels of sophistication of Ss delib- 
erately and justifiably produced by 
the investigators in their attempts to 
solve two different problems by the 
use of similar equipment. 


SUMMARY 


This experiment was designed to 
test the effect of instructions upon the 
perception of thegravitational vertical. 
In various positions of lateral inclina- 
tion Ss were required to set a target 
rod to the gravitational vertical against 
a visual framework. 

A naive group of Ss was instructed 
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without amplification to set the target 
to the vertical. A sophisticated group 
was required to set the target rod to 
the gravitational vertical, was prac- 
ticed in the performance and frequently 
reminded during the experimental 
series to maintain the gravitational 
vertical as the criterion of judgment. 

The results indicate that the judg- 
ments of the naive group were less 
precise and more variable than those 
of the sophisticated group. 

It is suggested that the discrepant 
results reported by Witkin and Asch 
and by Mann may be due to different 
levels of sophistication of Ss deliber- 
ately and justifiably produced by the 
instructions given by the experi- 
menters in their attempts to solve two 
different problems. 


(Received for early publication 
August 28, 1952) 
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EFFECTS OF INTENSE PURE TONE STIMULI WHEN MAGNITUDE 
OF INITIAL INJURY IS CONTROLLED! 


IRVING E. ALEXANDER AND FREDERICK J. GITHLER 


Princeton Universiiy 


In 1890 Habermann (7) published a 
study in which, in a single case, high- 
tone hearing loss was correlated with 
injury to the basal region of the coch- 
lea. Since that time stimulation- 
deafness studies have been undertaken 
with the hope of relating the analysis 
of pitch to a place principle in the 
cochlea. Indeed, the German inves- 
tigators in the early years of this cen- 
tury became embroiled in a contro- 
versy regarding the theoretical signifi- 
cance of stimulation-deafness results. 
Their studies, which generally showed 
restricted basal end lesions for the high 
tones and much more widespread in- 
jury for the low tones, left much to be 
desired experimentally. Control of 
stimulus conditions was generally poor 
and tests of sensitivity were either 
inadequate or absent. For a thorough 
review of the literature up to 1935 the 
reader is referred to Kemp (8). 

In the last 25 years, with improved 
apparatus and better sensitivity tests, 
interest in this problem has been 
renewed. Now in addition to an 
evaluation of the effect of trauma on 
cochlear structures one may use the 
conditioned-response technique, the 
electrical potentials of the cochlea, or 
auditory nerve potentials as sensi- 


1 From the Princeton Psychological Labora- 
tory. This research was supported in part by 
the Office of Naval Research, under Contract 
N6-onr-270, T. O. 3, Project NR 140-322, and 
also in part by Higgins funds allotted to Prince- 
ton University. Permission is granted for repro- 
duction, translation, publication, use and disposal 
in whole or in part by or for the United States 
Government. This research was reported at the 
1952 meetings of the Eastern Psychological 
Association. 
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tivity measures of the state of the 
receptors, the nerve, or both. 

Despite the technical improvements 
there is still not a great deal of agree- 
ment in interpretation of stimulation 
deafness results. Some investigators 
(12, 6) have reported their findings as 
favorable to a place hypothesis, while 
others (13,3) have been somewhat less 
optimistic. These differences some- 
times extend beyond the interpreta- 
tion of theoretical issues and one may 
find antithetical results reported. 

There are two major sources of diffi- 
culty in assessing the recent evidence 
and these perhaps are responsible for 
the apparent equivocation. In almost 
all the studies considerable differences 
in individual susceptibility to trauma 
were found. Since these studies gen- 
erally dealt with few Ss, the wide 
variability tended to obscure any par- 
ticular effects of the experimental vari- 
able, i.e., loud sound. As a conse- 
quence of this, great latitude was 
afforded in the interpretation of the 
findings. 

Secondly, most investigators have 
been concerned with structural injury 
and its correlates with sensitivity in 
the acute stage, that is, shortly follow- 
ing the exposure. When such meas- 
urements are made, one is struck with 
the apparent lack of correlation be- 
tween structural damage and sensi- 
tivity indices. Indeed, this state of 
affairs has led some investigators to 
place little faith in the measurement 
of cochlear potentials (9). The pres- 
ent authors, however, have recently 
reported closer agreement between 
changes in electrical sensitivity and 
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TABLE 1 


IMMEDIATE Loss tn SENsiITIVITY Expressep IN DEcIBELS 








Injury s 


Test Stimulus (cps) 





Stimulus 


1000 2000 





300 762 85 88 111 112 
cps 766 60 61 70 63 





5000 780 48 48 62 65 
cps 782 58 59 74 65 






































degree of structural damage when time 
is allowed for the injury process to run 
its course (2, 3). 

In the light of the above considera- 
tions, the aim of the present study is to 
introduce two improvements in the 
usual stimulation-deafness methodol- 
ogy with the hope of clarifying the 
existing picture. The attempt was 
made to reduce individual differences 
by inducing a known decrement in 
sensitivity. A test-retest procedure 
was employed to assess both the im- 
mediate and more chronic effects of 
intense pure tones. 


PROCEDURE 


Guinea pigs were used as Ss. Using a sterile 
technique, a lateral approach to the auditory 
bulla was made and cochlea potentials were read 
from the round window membrane. A more 
detailed report of the procedure may be found 
elsewhere (1,13). A 1-microvolt response audio- 
gram was determined for the experimental ear. 
This is a plot of the sound pressures needed to 
produce a l-microvolt response at the various 
frequencies. The ear was then stimulated with 
one of two test tones, 300 cycles or 5000 cycles 
at a sound pressure of 1000 dynes/cm.?.. These 
tones were chosen because they represent differ- 
ent ends of the audiogram and because compari- 
son data from acute studies were available. The 
sound was maintained long enough to induce a 
decrement of 60 db in the sensitivity to the 
exposure tone. When this decrement had been 


established, the sensitivity of the ear was again 
determined for the remainder of the audiogram. 


The incision was then sutured and the animal 
was allowed to recover. 

When a minimum of three weeks had elapsed, 
the exposed ear was again tested for sensitivity 
to tones from 200 cycles to 10,000 cycles. A 
l-microvolt response contour was also deter- 
mined at this time for the nonstimulated ear. 
This provided a limited check on the general 
effects of the initial operation on the normal ear? 

When all measurements had been made, the 
animal was sacrificed and the temporal bones 
removed for later histological evaluation. The 
entire histological procedure has been previously 
described in detail (2, 10). 

In all, eight Ss were tested, four in each expo- 
sure-tone group, and seven were carried through 
the histology. 


RESULTS 


Let us first look at the initial effects 
of overstimulation with a 300-cycle 
and a 5000-cycle tone. Table 1 shows 
the difference in sensitivity between 
the prestimulation and poststimula- 
tion audiograms. For the 300-cycle 
stimulus injury occurs rapidly. In 
three of the four cases it was possible 
to restrict the loss at the exposure 
frequency to approximately 60 db. 
The other ear was more seriously in- 
jured. The exposure times to produce 
these decrements ranged from 3 to 6 
min. From the individual records it 
can be seen that the 300-cycle tone 


2 No operative effects were noted. The con- 
trol ear audiograms were similar to those of the 
exposed ears before stimulation. 
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Fic. 1. Composite audiograms determined 
immediately after exposure to either a 300-cycle 


or a 5000-cycle tone 


spares no part of the auditory range. 
All test tones were impaired. Of the 
three ears in which the criterion (60 
db loss) was closely met, two (759 
and 765) show similar patterns while 
the other (767) is in general less 
severely impaired. 

It was a much simpler task torestrict 
the sensitivity loss to 60 db. for the 
exposure frequency when 5000 cycles 
was used as the stimulus, for here 
injury is more gradual. In every case 
this restriction was accomplished 
within 4 db and in a period of time 
ranging from 18 to30 min. Again, as 
for the low-tone stimulus, all tones 
were affected by this overstimulation. 
In all cases the range of individual 
differences was restricted. 
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Fic. 2. Composite audiograms determined 
three weeks after exposure to either a 300-cycle 
or a 5000-cycle tone 


The differential effects are presented 
in Fig. 1, which shows the average loss 
curve for each of the stimulus tones. 
In the 300-cycle curve the results from 
the ear injured far beyond the cri- 
terion level are not included. Taking 
each curve separately we see little 
variation in the effects of the low-tone 
stimulus between 200 cycles and 3000 
cycles. The curve for this region is 
almost flat. Sensitivity for the high 
tones, however, from 5000 cycles to 
10,000 cycles, is somewhat more im- 
paired. For the 5000-cycle stimulus 
the curve remains reasonably flat 
throughout the range with a slightly 
greater effect shown at 500 cycles and 
700 cycles. The similarity between 


TABLE 2 


Sensitiviry Loss 1n Decispets Turee Weexs Arter STIMULATION 








Injury Ss 


Test Stimulus (cps) 





Stimulus 


1000 2000 3000 5000 7000 10000 





300 | 762 | 63 | 6 | 72 | 75 
cps | 766 | 73 | 77 | 83 | 70 





767 44 26 23 14 13 13 17 13 8 3 
768 17 20 23 25 19 19 22 16 3 7 
5000 780 2 1 ‘4 9 1 2 8 4 1 4 
cps 782 8 1 2 6 10 2 1 3 0 3 
783 40 43 39 41 32 37 24 24 25 21 
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the two curves for most of the range is 
marked. 

This similarity disappears when we 
examine the more chronic results. 
Figure 2 depicts the average curves for 
the two stimulus tones after three 
weeks had elapsed and Table 2 con- 
tains the data from which these were 
drawn. The trend is easily seen. 
The 300-cycle curve shows a slight 
improvement only in the high-tone 
range. As a result all the points on 
the curve fall within a 13 db range. 
Throughout the range there is still a 
severe decrement in sensitivity. 

The situation has changed radically 
for the 5000-cycle ears. Improve- 
ment has been general and extensive 
for all tones. While the two curves 
retain their characteristic flatness, 
they are no longer dealing with the 
same magnitudes of loss in sensitivity. 

Average curves based on arithmetic 
means of decibel loss are subject to 
distortion when there is wide varia- 
tion in the original data. Let us, 
therefore, examine Table 2 for another 
view of the chronic results. 

Of the four 300-cycle ears investi- 
gated, two (759 and 766) show no 
recovery at all while the ear which was 
so extensively damaged initially (762) 
“recovers” but is still 10 to 20 db 
poorer than the criterion level. The 
fourth ear (767) shows recovery in the 
middle- and high-tone range to within 
20 db of the prestimulation levels. 
This was the ear that was previously 
described in the poststimulation test- 
ing as least impaired in its group. 

In the 5000-cycle series, two of the 
four (780 and 782) recover completely 
to the prestimulation levels. The 
other two (768 and 783) have made 
partial recoveries and are distinctly 
better in sensitivity throughout the 
audiogram than they were immedi- 
ately following stimulation. It is ap- 
parent, then, for a constant loss of 60 
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Fic. 3. Linear transformations of the state 
of the organ of Corti and the corresponding 
electrical audiograms for ears of the 300-cycle 
group. These measurements reflect the condi- 
tion of the ears three weeks after the exposure. 
The animal number and percentage of damage 
accompany the organ of Corti map. 
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Fic. 4. Linear transformations of the state of the organ of Corti and the corresponding elec- 
trical audiograms for ears of the 5000-cycle group. These measurements reflect the condition of 
the ears three weeks after the exposure. The animal number and percentage of damage accompany 


the organ of Corti map. 


db, one may expect recovery when 
5000 cycles is the stimulating tone 
while the probability of recovery is 
diminished seriously when 300 cycles 
serves as the stimulus. 

We can now turn to the histological 
findings. Figure 3 contains the effects 
of a 300-cycle injury. In each case is 
presented a linear transformation of 
the organ of Corti and an evaluation 
of its structures.? In addition the 
electrical audiogram at the time the 


* A value of 20% was assigned to each row of 
hair cells, four in all, and the remaining 20% 
was distributed among the other organ of Corti 
structures. Damage to the auditory nerve fibers 
and ganglion cells was also assessed but is not 
reported here. 


animal was sacrificed is presented for 
comparison. Of the three cases that 
were carried through the histology, 
two (759 and 762) show almost com- 
plete destruction of the organ of Corti 
structures except in the first few milli- 
meters of the basal turn. The electri- 
cal audiograms are also most severely 
depressed. The third (767) shows 
moderate to severe injury starting at 
8 mm. and continuing almost to the 
apex. The corresponding electrical 
audiogram shows a moderate decrease 
in sensitivity for most of the range and 
almost complete recovery at 7000 
cycles and 10,000 cycles. 

The histological results for the 5000- 


cycle stimulus are presented in Fig. 4 
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along with the corresponding audio- 
grams. In the two ears that com- 
pletely recovered electrical sensitivity 
(780 and 782) we find rather small 
specific lesions in the region of 6 to 8 
mm. Here are specific cochlear lesions 
with normal cochlear potentials. In 
783 the main effects are found between 
6 to 10 mm. The loss in electrical 
sensitivity ranges between 21 and 43 
db and the high tones are less im- 
paired. The fourth ear (768) shows 
moderate injury from 5.4mm. through 
15.4 mm. The pattern of injury is 
similar to the one found in 767 of the 
300-cycle group. The electrical audio- 
grams of the two are also very similar. 

As in our previous work (2, 3) when 
an average figure is used to represent 
the electrical loss and this is correlated 
with a single figure representing the 
percentage of cochlear structures miss- 
ing, the correspondence is very close. 
To the seven cases presented here the 
rank-order correlation is approxi- 
mately .9. 


Discussion 


By inducing a sensitivity loss of 
known magnitude at the exposure fre- 
quency we have succeeded in reducing 
the effect of individual variations in 
susceptibility totrauma. Asa result, 
the effects of the stimulating tones 
may be more accurately gauged. For 
the efficacy of our procedure let us 
compare our work with previous 
results: Wever and Smith (13) also 
used a 300-cycle stimulus at a sound 
pressure of 1000 dynes/cm.? for a con- 
stant interval of 4 min. This time 
interval falls within our range of 3 to 6 
min. of stimulation. Yet their acute 
results range in variability from no ap- 
parent effects to ones that reach and 
exceed ourcriterion. One might argue 
that our acute results at 300 cycles ap- 
pear uniform because of the magnitude 
of the sensitivity loss imposed on the 


ear. On the surface there seems to be 
some justification for this argument 
since for the most part the 300-cycle 
ears donot show much recovery. The 
results from the 5000-cycle cases, how- 
ever, run counter tothis notion. Here 
we have uniform effects of a severe 
nature in the acute stage and varying 
degrees of recovery, including a return 
to prestimulation levels. 

These results support the conten- 
tion that the low tones inflict greater 
damage to the cochlea than do the 
high tones. Smith (10) took this 
position on the basis of his work and 
thus postulated that the amplitude of 
displacement of the basilar membrane 
is the critical variable in determining 
the magnitude of structural damage. 
Davis et al. (6) reported findings in 
which the high tones were more effec- 
tive in causing damage. These inves- 
tigators felt that this was due to the 
damping properties of the tensor tym- 
pani muscle as it carried out its pro- 
tective function. We find this expla- 
nation somewhat tenuous in view of 
the fact that overstimulation was car- 
ried on for protracted periods ranging 
from 16 to 24 hr. a day for many days. 

With regard to auditory theories we 
find in this work, as in our noise 
studies, the weight of the evidence 
against a place hypothesis. In our 
two fully recovered ears (780 and 782) 
we find upper basal lesions which 
cover approximately 10% of the 
cochlea and no corresponding diminu- 
tion in cochlear potentials. Were we 
to be guided by the various cochlear 
localization maps that have been pro- 
posed (4, 5, 11), we should expect to 
find marked impairments of sensitivity 
in the region of 3000 cycles. That 
this is not the case is obvious from the 
sensitivity curves presented. 

In addition, we find similar his- 
tological and electrical injury when 
tones varying widely in frequency are 
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used as producers of that injury (767 
and 768). Certainly, if the cochlea 
is selectively tuned in other thana very 
gross fashion, we should not expect 
such a result. A shred of evidence 
may be gathered for high-tone locali- 
zation from these same ears where there 
is little loss at the high tones and no 
injury for most of the basal turn. But 
even such evidence leaves one in a 
quandary when in the same study one 
finds cases in which basal structures 
are left intact but sensitivity loss 
extends throughout the range (759, 
762, 783). 

One further fact emerges from this 
study which we are at a loss to explain 
but which is also not commensurate 
with place theory expectations. Acute 
sensitivity impairment to a low-tone 
stimulus is greatest for the extreme 
high-tone range; i.e., 5000-10,000 
cycles. This same finding was re- 
ported earlier by Wever and Smith 
(13). 


SUMMARY 


Guinea pigs were subjected to over- 
stimulation from either a 300-cycle or 
a 5000-cycle tone for a period long 
enough to produce a 60 db loss in sen- 
sitivity for the stimulating tone. 
Cochlear potentials were taken prior 
to stimulation, immediately following, 
and after a lapse of three weeks. The 
ears were also studied histologically. 
The findings are as follows: 


1. Individual variability in suscep- 
tibility to trauma is reduced under 
these conditions of stimulation. 

2. Test-retest measurement of coch- 
lear potentials in the guinea pig is 
feasible. 

3. The acute results show that for a 
300-cycle stimulus all tones up to 5000 
cycles are affected to the same degree. 
The highest tones tested are more seri- 
ously impaired. For a 5000-cycle 


tone the entire audiogram suffers an 
approximately equal loss. 

4. After three weeks some recovery 
of function takes place in all of the 
5000-cycle ears. Two of the group 
returned to normal levels. 

5. The 300-cycle injury is much 
more permanent. It appears that at 
equal sound pressures a low-tone stim- 
ulus is much more effective in produc- 
ing injury than a high-tone stimulus. 

6. Cochlear injury for a 5000-cycle 
stimulus is generally more restricted 
than for a stimulus of 300 cycles. 

7. For the cases reported here there 
exists a rank-order correlation of ap- 
proximately .9 between the average 
loss in the electrical audiogram and 
the percentage of cochlear structures 
destroyed. 

The results were discussed with ref- 
erence to a place principle in the coch- 
lea and were interpreted to be gener- 
ally unfavorable to such a principle. 


(Received July 26, 1952) 
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LEARNING AND PERFORMANCE IN EYELID CONDITIONING AS 
A FUNCTION OF INTENSITY OF THE UCS! 


KENNETH W. SPENCE 


State University of Iowa 


Experimental studies concerned 
with the effects upon behavior of vari- 
ation of the motivational level of S 
may be grouped into three classes. 
The first class contains those experi- 
ments in which after training in the 
situation under some constant period 
of deprivation (7p) a test or measure- 
ment period is given in which 7'p is 
varied. These studies are regarded as 
providing a measure of the drive factor 
(D) that determines response strength 
and the empirical relation R = f(T») 
is taken as a reflection of the relation 
of Dto Tp. The classical obstruction 
box studies of Warden and his asso- 
ciates (19) and more recent investi- 
gations (4, 8, 12, 13, 20) in which the 
Skinner box type of situation was used 
fall into this class. 

In the second class of experiments 
Tp is varied (i.e., is different for dif- 
ferent subgroups) during the first 
training period and then a second test 
period is given in which Tp is the same 
for all groups. The objective of these 
studies has been to ascertain whether 
the learning factor (habit strength) is 
a function of the differences in drive 
strength resulting from different values 
of Tp during the first period. In- 
stances of experiments designed in 
this fashion are those of Finan (3), 
Kendler (7), and Strassburger (15). 
All of these studies used simple instru- 
mental conditioning. 

A third group of studies has not 


1 The writer wishes to acknowledge the assist- 
ance of George Harker, Henry Loess, Wallace 
McAllister, Donald Swisher, and Janet Taylor 
who served as assistants either during prelimi- 
nary experimentation or the present study. 
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attempted to differentiate between 
the two factors of habit (H) and drive 
(D) but has simply studied perform- 
ance during the training period as a 
function of differences in Tp. Typi- 
cally these investigations have involved 
more complex learning situations such 
as the maze (9, 16, 17, 18) or discrimi- 
nation learning (2, 10). Recently, 
however, two studies of this type have 
been reported in which classical de- 
fense conditioning was employed (11, 
14). In these studies the experi- 
mentally manipulated motivational 
variable was the intensity of the un- 
conditioned stimulus (UCS) rather 
than the time of deprivation of some 
goal object. 

This third group of studies, it 
should be noted, does not permit one 
to identify which factor, learning 
(habit strength) or drive strength, 
lies behind the obtained differences in 
performance level. Quite frequently, 
however, these investigators have in- 
ferred that the acquisition process 
(habit formation) was superior under 
conditions of strong motivation. 
Thus, Passey (11) interpreted his 
finding that the level of conditioning 
was a positive function of the inten- 
sity of the UCS as supporting Hull’s 
assumption (5) that more habit 
strength was developed with a stronger 
drive. He pointed out that this was 
in line with Hull’s special reinforce- 
ment hypothesis that related the incre- 
ment of habit strength per trial to the 
amount of drive reduction or drive 
stimulus reduction. 

While it is true that this finding is in 
agreement with the implication of 
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Hull’s hypothesis, it should be noted 
that there are two other possible inter- 
pretations of it even within the frame- 
work of Hull’s theorizing. Thus, 
according to this formulation, response 
strength is a function of both habit 
and drive strength, i.e., R = f(H XD). 
It is conceivable that the increments 
of habit strength (H) do not vary with 
the intensity of the UCS and Tp but 
that only drive strength (D) does. 
Finally, there is the possibility that 
differences in the intensity of the UCS 
or in 7p might result in differences in 
both of these hypothetical factors, H 
and D. The experimental design em- 
ployed by Passey does not permit one 
to make any decision among these 
alternatives. 


The appropriate experimental design with 
which to investigate this problem is the factorial 
type shown later in Table 1. The experiment is 
conducted in two periods. In the first the Ss are 
divided randomly into two groups, one of which 
is conditioned with a weak UCS and the other 
with a strong UCS. In the second period these 
two groups are each divided into two subgroups 
with half of the Ss continuing with the same UCS 
as in the first period and half being switched to 
the other intensity. There are thus four differ- 
ent subgroups in this second experimental period. 

Response measures obtained in the second 
period provide the data. Comparison of the row 
means indicates whether response strength dur- 
ing this period varies with the level of intensity 
of the UCS present in the first period. As the 
level of intensity of UCS in the second measuring 
period is controlled, this comparison cannot 
reflect differences in drive strength (D) resulting 
from a difference in intensity of UCS. Hence, a 
significant difference in these row means would 
be interpreted as indicating that a different 
amount of habit strength developed in the two 
groups during the first period. An accompany- 
ing significant difference in the column means 
would be interpreted as reflecting differences in 
drive strength (D) resulting from the different 
intensities of UCS present at the time of meas- 
urement. Presumably such a difference would 
also reflect differences in habit strength that 
would develop during the second period. If, on 
the other hand, there is no difference in the row 
means and a significant difference in the column 
means, this would lead to the interpretation that 
habit growth was not a function of the motiva- 


tional level of the organism and that the difference 
between the groups under different intensities of 
UCS reflected differences in drive strength (D) 
only. The absence of a significant difference in 
the column means and a significant difference in 
the row means would imply that the difference 
in response strength reflected different habit 
strengths with no difference in drive strength. 

Attention should be called to the fact that the 
studies in the first two of our three classes of 
motivational experiments conform partly in 
design to the factorial type described above ex- 
cept that they have been incomplete designs. 
Thus, the investigations in the first group had 
but a single motivational condition in the first 
training period and two or more different moti- 
vational conditions in the second period. In 
terms of Table 1 there was but a single row and a 
number of columns consisting of but a single cell. 
Studies in the second group, on the other hand, 
have typically had but a single column with a 
number of different rows each with but a single 
cell. Only the Kendler study involved the com- 
plete factorial design. 

The partial nature of the experimental design 
employed in these investigations has resulted in a 
certain amount of distortion in their findings. 
Thus, in the first class of studies, those concerned 
with differences in drive strength (D), the typical 
procedure has been to carry on the original train- 
ing in the setup with a deprivation interval of 
approximately 24 hr. Subsequent comparisons 
of response strength at different deprivation 
periods, e.g., 3, 6, 12, and 24 hr., do not make 
proper allowance for the fact that the 3-, 6-, and 
12-hr. groups are performing at different condi- 
tions from those employed in the first period 
while the 24-hr. group has the same deprivation 
condition. Interpreted in terms of our theo- 
retical constructs H and D, this design provided 
assurance that a constant amount of H was 
developed in the groups to be run subsequently 
under the different deprivation intervals. It 
did not, however, control for the possibility that 
a significant part of the stimulus complex that 
acquires habit strength for the response consists 
in the internal cues from the need state (Sp). If 
these cues are discriminably different for the dif- 
ferent deprivation periods, and there is experi- 
mental evidence (1, 6) to indicate that they are, 
then the stimulus complex for any groups run 
under a different deprivation condition in the test 
period from that used in the original training 
period is changed. This would mean that the 
effective habit strengths available for the differ- 
ent groups during the test periods would not be 
constant but would differ from each other in 
varying amounts that would be a function of the 
amount of change in the stimulus complex 
(principle of stimulus generalization). So far as 
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providing the precise nature of the function relating 
D to Tp, these studies have been worthless 
because they have not controlled for these dif- 
ferences in generalized habit strength.? 


The present experiment attempted 
to investigate the question as to 
whether differences in response 
strength in classical defense condition- 
ing resulting from the use of different 
intensities of the UCS are to be inter- 
preted as reflecting, in part at least, 
differences in the strength of condi- 
tioning, i.e., habit strength (H), or 
whether the differences reflect only 
differences in drive strength (D). 
The experimental design employed 
was the factorial type discussed above. 


PROCEDURE 


Subjects —A total of 97 men from an intro- 
ductory course in psychology served as Ss. 
Women were not used as it was found in an ex- 
ploratory experiment that a very high propor- 
tion of them responded with voluntary blinks to 
the CS under the condition in which the UCS 
(air puff) was very strong (5.0 lb./sq. in.). 
Eleven men were eliminated because of the fact 
that they gave a very high incidence of voluntary 
closures to the conditioned light signal. Such 
voluntary responses were distinctly different from 
conditioned responses both in their form and in 
their latency distribution. Thus, the voluntary 
pattern of response, consisting of a smooth, sharp 
closure which was maintained until after the 
onset of the air puff, occurred almost exclu- 
sively in the period 200 to 460 msec. following the 
onset of the CS. The responses predominating 
in the period 460 to 755 msec. were more gradual 
and much more irregular in their pattern. All 
Ss who had a high incidence (one-third or more) 
of responses with latencies less than 460 msec. in 
either one of the two conditioning periods were 
eliminated from the experiment. Questioning 
of these Ss revealed that there was not always 
awareness that such “voluntary” closures were 
taking place. Six Ss were eliminated from the 
experiment because of error on the part of £ or 


2 Yamaguchi’s study is an exception to this in 
that in it the time of deprivation employed in the 
original training period was the same as that 
used in the test period. In effect Yamaguchi 
took advantage of the Strassburger and Kendler 
findings that 7p did not effect the amount of H 
developed in the first period. 


apparatus difficulties. Eighty Ss remained for 
the experiment described below. 

Apparatus and method of recording—The S 
was seated in a dental chair in a semidarkened 
room adjoining that in which the recording appa- 
ratus and stimulus controls were located. Fixed 
in a reclining position in the chair, S was in- 
structed to blink on receiving a ready signal, and 
then to fixate on a 6-in. circular milk glass disc, 
placed at a distance of 132 cm. The brightness 
of the disc between trials was .05 apparent ft.- 
candles. 

The CS consisted in an increase in brightness 
of the disc to 1.51 apparent ft.-candles. The 
duration of the CS on each trial was 825 msec. 
The onset and duration of the CS and UCS were 
controlled by means of Hunter electronic timers 
and were recorded on a Brush polygraph by 
means of a Brush BL-902 penmotor. 

The UCS, a puff of air, followed the onset of 
the CS by 755 msec. and was delivered to the 
right eye. The-source of the air puff was a high 
pressure compressed air line provided in the 
building. Appropriate pressure was obtained by 
means of a variable (0-10 lb./sq. in.) air pressure 
reducing valve. A pressure gauge was used to 
provide the two strengths of air pressure em- 
ployed, .25 lb./sq. in. in the case of the weak UCS 
and 5.0 lb./sq. in. in the case of the strong UCS. 
A 110-v., 60-cycle AC operated solenoid valve, 
controlled by an electronic timer, limited the 
duration of the puff to 50 msec. This represents 
a much superior air-puff system than that previ- 
ously employed in this laboratory in that the air 
pressure can be maintained at the stated pressure 
for any desired time interval. 

The movement of the eyelid was recorded by 
a system similar to that described earlier by 
Spence and Taylor (14) except that a simpler 
method of amplification was employed. Briefly 
the technique involved the use of (a) a light- 
weight microtorque potentiometer, (b) a direct 
current amplifier, (c) a Brush polygraph, and (d) 
a Brush BL-902 penmotor. 

The microtorque potentiometer was mounted 
on an adjustable headband just above S’s eye. 
A very light lever was attached at right angles to 
the variable (rotating) arm of the potentiometer 
and a mechanical linkage was accomplished 
between it and a tiny plastic lever fastened to 
the upper eyelid by adhesive tape. A small 
restoring spring was added to the potentiometer 
arm thus permitting it to follow the eyelid’s 
motion. The system was very light and involved 
a minimum of work. Movement of the eyelid, 
with its accompanying rotation of the potentio- 
meter arm, altered the DC voltage appearing on 
the grids of a push-pull vacuum tube amplifier. 
The latter was a three stage push-pull amplifier 
with a push-pull cathode follower output stage 
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being used to match the impedance of the Brush 
penmotor. 

Conditioning procedure.—The experiment was 
conducted in two periods, the second following 
the first by 24 hr. Following the reading of 
instructions each S received three presentations 
of the light alone in order to check for any initial 
conditioned response tendency to the light. A 
single air puff was then administered and the 
eyelid behavior recorded for 40 sec. All Ss were 
then given 30 paired presentations of the light 
and air puff. Intertrial intervals of 15, 20, or 
25 sec., averaging 20 sec. and arranged according 
to a fixed schedule, were used. 

On this first day half of the 80 Ss were condi- 
tioned with the relatively weak air puff (.25 lb./ 
sq. in.) and half with the strong puff (5.0 lb./sq. 
in.). At the beginning of Day 2 the instructions 
were repeated and a further 70 paired presenta- 
tions of the CS and UCS were given. On this 
day half of the Ss trained with the weak puff on 
Day 1 were trained for 70 more trials with it and 
half were shifted to the strong UCS. Similarly, 
half of the Ss trained on Day 1 with the strong 
UCS had the strong UCS continued on Day 2 and 
half were shifted to the weak UCS. Thus, on 
Day 2 there were four subgroups of 20 Ss each 
in terms of the UCS used in the two conditioning 
periods: WW, WS, SS, and SW. 

At the end of the session all Ss were asked not 
to discuss the experiment with other members of 
the class. 


RESULTs AND Discussion 


The values in the four cells of 
Table 1 represent the mean number 
of CR’s made by the four subgroups 
in the first 20 trials of the second day. 
This 20-trial period was chosen be- 
cause a longer period ran into the 
problem that our measuring scale has 
a ceiling, i.e., 100%. The frequency 
measure of response strength is quite 
satisfactory only so long as we do not 
reach this ceiling level. Once an S 
has reached 100% any further in- 
crease in strength of the S—R tendency 
(sER) with further training cannot be 
reflected by this measure. By the 
second ten-trial block two Ss were 
responding 100%, and in the next ten- 
trial block four more Ss reached this 
level. Accordingly, it was decided to 
use the data for the first 20 trials only. 


The scores obtained in this period 
provided quite satisfactory distribu- 
tions so far as the requirements (vari- 
ance and form) for statistical analysis 
are concerned. Score distributions 
for longer periods were quite unsatis- 
factory, for with the ceiling on the 
measure the distributions became 
more and more skewed as more Ss 
attained the 100% level of response. 

Returning to Table 1, it will be 
seen that there are sizable differences 
both in the case of the row means and 
the column means. The statistical 
significance of these variations was 
tested by means of analysis of vari- 
ance, the results of which are pre- 
sented in Table 2. It will be noted 
that there was no significant inter- 
action. The “main effect” of the 
intensity of the UCS on Day 1, as 
shown by the difference between the 
two row means, is revealed to be sig- 
nificant by the F test at a level just 
short of the .01 level of confidence. 
The difference between the column 
means, which reflects the ““main effect” 
of the intensity of UCS at the time of 
measurement (Day 2), is also seen to 
be highly significant by the F test, in 
this instance beyond the .001 level of 
confidence. 

The significant difference between 
the row means supports the interpre- 
tation, it will be recalled, that different 
strengths of conditioning (amounts of 


TABLE 1 


Mean Numser or CR’s Mape In First 
20 Triats or Day 2 

















Day 2 UCS 
Day 1 UCS Gb. fea. in.) Means 
Ib. /sq. in. Reflecting H 
25 5.00 
25 5.65 8.80 7.23 
5.00 7.45 13.00 10.23 
Means 
Reflecting D 6.55 10.90 
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TABLE 2 
ANALysis OF VARIANCE FoR Triats 31-50 
(Day 2) 
Source df —. F* 
Intensity Day 1 1 180.00 6.40 
(rows) 
Intensity Day 2 1 378.45 | 13.46 
(columns) 
Rows X columns 1 28.85 1.03 
(interaction) 
Within groups 76 28.11 
Total 79 














* 05 level, F = 3.98; .01 level, F = 7.01. 


habit strength) were developed during 
the first day of training when the 
UCS was different for the Ss com- 
pared. This result would be in line 
with the notion that the increment of 
habit strength (AH) is a function of 
the strength of drive and interpretable 
as in agreement with the drive-reduc- 
tion theory of reinforcement; that is, 
the stronger UCS would produce a 
stronger drive and with its cessation 
there would be a greater drive reduc- 
tion. 

The significant difference between 
the column means indicates that 
response strength varied with the in- 
tensity of the UCS employed at the 
time of measurement. Presumably 
this difference reflects the differences 
in drive strength (D) produced by the 
different intensities of UCS. How- 
ever, if habit strength also varies with 
intensity of UCS, then there must also 
be a difference between the habit 
strengths of the two groups (.25 and 
5.0). This difference in habit strength 
would be that which developed during 
this 20-trial period of Day 2 for they 
were equated in terms of the amount 
of habit strength developed on Day 1. 
While no evaluation can be made of 
the differences, it is of some interest 
that the difference between the column 
means (4.4) is 47% larger than the 


difference between the row means 
(3.0). The former involves only a 
difference in H based on 30 trials with 
different intensities of UCS. 

A problem that arises in connection 
with this interpretation of a different 
amount of habit strength being built 
up with the different intensities of 
UCS is the negative evidence that has 
been obtained in studies employing 
an appetitional need such as hunger. 
Thus, the experiments of Kendler (7) 
and Strassburger (15), which involved 
varying the time of deprivation of 
some goal object (food, water), did not 
provide comparable significant dif- 
ferences in the row means of the 
response measures in the second pe- 
riod. An answer that suggests itself 
is that the amount of reinforcement 
(drive reduction) in the food reward 
situation is the same for different con- 
ditions of deprivation. Indeed, the 
reinforcement probably is secondary 
in nature and does not involve any 
primary drive reduction whatever. 
In the conditioning study, on the 
other hand, the reinforcement was 
primary (pain reduction) and, pre- 
sumably, different for the different 
intensities of air puff. 


There are, of course, other possible interpre- 
tations as to the factor responsible for the differ- 
ence in the row means, i.e., the difference related 
to the intensity of the UCS on Day 1. One 
most likely to be offered by the theorist opposed 
to any kind of reinforcement notion is that a 
bigger response is probably conditioned in the 
case of the stronger UCS. The logic is not too 
clear as to how one gets from this to the pre- 
dicted greater frequency of CR for the group 
which is making the larger unconditioned 
response (UCR) although it is evident that a 
higher average magnitude of CR would be ex- 
pected. Presumably it is assumed that the 
magnitude of the CR increases with successive 
trials and there is thus a greater likelihood of a 
response reaching the criterion magnitude (1 
mm.) of a CR earlier in the case of Ss with a large 
UCR than those with a smaller one. An exam- 
ination of the magnitude of the UCR during the 
first ten trials of Day 1 did reveal that the mean 
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UCR was larger for the 5.0-lb. group than for 
the .25-lb. group (18.17 mm. vs. 14.5 mm.). 
However, an examinaticn of the scatter diagrams 
relating amplitude of UCR on these trials with 
the number of CR’s on Day 2 (Trials 31-50) 
showed no evidence of a positive correlation. In 
fact the relationship was slightly but insignifi- 
cantly negative. 

As a further means of checking on the possi- 
bility that amplitude of UCR was a factor, Ss in 
the four subgroups were matched by selecting 
ten Ss in each subgroup whose mean amplitude 
of UCR on Day 1 fell between 14 and 20 mm. 
The mean amplitudes of the four subgroups when 
so matched were either 17.3 or 17.4 mm. The 
mean number of CR’s in Trials 31-50 on Day 2 
for the WW and WS subgroups and for the SW 
and SS subgroups were then calculated for these 
Ss. They were 6.5 for the Ss (WW and WS) 
who had the weak UCS on Day 1 and 11.5 for 
the Ss (SW, SS) who had the strong UCS. The 
difference, 5.0, is even larger, it will be noticed, 
than that which appears between the row means 
in Table 1. 

A second possible interpretation of this finding 
that would not involve the reinforcement prin- 
ciple is that a stronger emotional or fear reaction 
was established to the cues from the experimental 
situation in the case of the more intense UCS. 
As the conditioned fear response is one of the 
determiners of the drive level (D), it would follow 
that Ss trained with the more intense UCS would 
be expected to havea higher D level. According 
to this formulation, however, the different levels 
of response are due to this different level of D 
(from the secondary motivation of fear) and not 
to a difference in H. The present experiment 
does not, of course, permit an evaluation of these 
rival interpretations. Its chief contribution is 
the experimental finding that performance level 
in the second period is a function of the differen- 
tial conditions in the first period. This fact 
points to the necessity for invoking some kind of 
historical or learning factor, different habit 
strength, conditional fear responses, or some such 
to account for it. 


SUMMARY AND CONCLUSIONS 


A factorial type experiment was 
conducted to investigate the question 
as to whether a difference in the inten- 
sity of the UCS (air puff) during a 
first training period would lead to dif- 
ferences in the level of eyelid condi- 
tioning in a second period in which 
intensity of the UCS was controlled. 


During the first experimental period 
of 30 trials 80 Ss, divided randomly 
into two equal groups, were condi- 
tioned. One group was conditioned 
with a weak UCS and the other with a 
strong UCS. In a second training 
period given on the following day the 
two groups were each divided into two 
subgroups with half of the Ss continu- 
ing with the same UCS and half being 
switched to the other intensity. 

Analysis of variance of the fre- 
quency of CR’s in the first 20 trials 
of the second period revealed that 
there were significant differences in 
performance in relation both to the 
intensities of UCS employed in the 
first period and also in relation to that 
used in the second period. The fact 
that response level varied with the 
intensity of the UCS employed on 
Day 1 was interpreted as in accord 
with Hull’s theory that the increment 
of habit strength is a function of the 
amount of primary drive reduction. 
The finding that the level of CR varied 
with the intensity of the UCS at the 
time of measurement on Day 2 was 
taken as a reflection of difference in 
drive level (D). Other possible inter- 
pretations not involving the principle 
of reinforcement were also suggested. 


(Received July 21, 1952) 
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PERCEPTION OF THE STATISTICAL STRUCTURE OF A RANDOM 
SERIES OF BINARY SYMBOLS! 


HAROLD W. HAKE AND RAY HYMAN 
The Johns Hopkins University 


Our problem was to learn how 
human Ss perceive the statistical 
structuring imposed upon apparently 
random series of binary symbols. The 
series used were similar to those pre- 
viously used in studying the effects of 
partial reinforcement in human verbal 
learning in that they were chance 
series constructed by following proba- 
bility rules determining the appear- 
ance of either symbol on each trial 
(2, 3, 6). 

It has been shown that Ss can learn 
the probability of occurrence of one 
of two possible symbols in such a 
chance series and will predict that 
symbol about as frequently as it is 
likely to occur in the series (2). These 
results seem to us to be convincing 
evidence that Ss must be responding 
to whole series of trials rather than to 
a single trial as an isolated unit. The 
stimulus for responding on each trial 
must be a composite of whatever 
experiences they have had on previous 
trials. This is certainly the case in 
human verbal learning of the type 
originated by Humphreys where the 
stimulus on each trial is the series of 
trials preceding it (4). The stimulus 
on a given trial differs from the stim- 
ulus on any other trial only in its 
temporal position in the trial series. 

We chose this experimental situa- 
tion, therefore, as a good one in which 
to learn how Ss perceptually group 


This work was supported under Contract 
N5-ori-166, Task Order I, between the Systems 
Coordination Division, Naval Research Labora- 
tory, Office of Naval Research, and The Johns 
Hopkins University. This is Report No. 166-I- 
160, Project Designation No. NR 507-470, under 
that contract. 


together the events occurring in a 
series of trials as the stimulus for 
action on a succeeding trial. By 
introducing sequential dependencies 
into some of our trial series, we varied 
the likelihood, from series to series, of 
a given trial being preceded by a par- 
ticular combination of symbols. We 
used four trial series which differed 
from each other in the probability of 
occurrence of a particular stimulus on 
any given trial (first-order probabil- 
ity), and differed from each other also 
in the degree to which the appearance 
of one of two possible symbols on a 
particular trial depended upon which 
symbol had occurred on the preceding 
trial (second-order probability). The 
degree to which the predictions of Ss 
were determined by such first- and 
second-order statistical structuring in 
the trial series was analyzed by a 
method derived from communication 
theory (8). 


MeETHOD 


Apparatus.—The symbol which Ss saw on 
each trial was either a vertical column or a hori- 
zontal row of four small lighted neon bulbs. The 
sequences in which these two symbols appeared 
were punched in code on teletype tape. This 
tape was read by an electronic sequence pro- 
grammer which presented all of the trials auto- 
matically. Each trial consisted of a warning 
light followed 4.5 sec. later by one of the two 
symbols. The intertrial interval was held con- 
stant at 9 sec., and the stimuli remained on for 
1.7 sec. 

Procedure.—All Ss saw a series of 240 symbols 
presented one after the other. Immediately 
following the warning light, and before the sym- 
bol appeared, on each trial, Ss were required to 
predict which of the two possible symbols would 
occur on that trial. They made this prediction 
by checking either H or V on answer sheets, 
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TABLE 1 
Tue ExperimMENTAL SERIES 
ie ~~ Series 1 | Series 2 | Series3 | Series 4 
P(H) 50 50 75 75 
Px(H) 50 80 75 90 
P(P) 50 50 25 25 
Py(V) 50 80 25 70 

















They were given no information about the rules 
by which the series were constructed. 

Four groups of Ss were used, each of which 
predicted just one series. The four series are 
summarized in Table 1. In Series 1 the H 
symbol occurred on one-half of the trials. This 
was also true of Series 2 which had the additional 
statistical restriction that whenever an H had 
occurred, the probability that it would be 
followed by another H was .80. Series 3 and 4 
had H symbols occurring on three-quarters of the 
trials. Series 4 had the additional sequential 
probabilities shown in the table. All series were 
constructed following these probability rules 
which applied to the V symbol also as shown in 
the table. They were constructed so that the 
rules held in each successive group of 24 trials. 

Subjects —A total of 40 Ss was used, ten in 
each group. They were men and women under- 
graduate students and were run in groups of two 
or three at a time. 


RESULTS 


The data of this study are the series 
of 240 predictions made by each S. 
In each series, every prediction except 
the first was made following the occur- 
rence of some set of events on previous 
trials. We have analyzed these pre- 
dictions to show the degree to which 
each was determined by these pre- 
vious events as a function of the num- 
ber of previous trials being considered 
and the number of different kinds of 
events which were considered to have 
occurred on those trials. 

Description of the uncertainty meas- 
ure.—Our basic measurement is the 
uncertainty measure which has been 
described fully elsewhere (8, 9, 10, 11). 
The computations of the uncertainty 
or entropy associated with a series of 


predictions of an S came from: 
H{j) = —2Z p(t) pi(7) logsp:(7), (1) 
7 


where H,(j) is our uncertainty in stat- 
ing which symbol S will predict on a 
given trial when we know the previous 
events described by (i); pi(y) is the 
proportion of trials on which S pre- 
dicted a given symbol when he had 
experienced the previous events (i); 
and p(t) is the proportion of trials 
before each of which he had experi- 
enced events (7). Since the predic- 
tions of our Ss always involved the 
choice of one symbol from a set of 
two, maximum uncertainty is unity 
and minimum is zero. 
This measure has the useful prop- 
erty that 
Hj) 2 H1()), (2) 


where H;(7) is an uncertainty measure 
based upon the same series of predic- 
tions as is equation (1) but where the 
previous events (1) are more com- 
pletely known. Expression (2) states 
that our uncertainty H;(j), which 
results from knowing more about what 
S has experienced at the time he 
makes _ choice, is less than or equal 
to our uncertainty H;(j), which re- 
sults from knowing less about what 
he has experienced at the time he 
makes his choice. The two uncer- 
tainties can be equal only in the case 
where increased knowledge of pre- 
vious events does not decrease our 
uncertainty. When increased knowl- 
edge of previous events which S has 
experienced does decrease our uncer- 
tainty in stating which symbol he will 
predict on the next trial, then we 
assume that these events were dis- 
criminated by S and did act to deter- 
mine his choice. We have used this 
relationship to determine the amount 
by which our uncertainty, in stating 
which symbol an S will predict on a 
given trial, is reduced as we define 
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TABLE 2 


Tue AveRAGE UNCERTAINTIES RESULTING 
WHEN THE Joint Events oF 
Previous Triats ArE Known 











uitverage. _ | Series 1 | Series 2 | Series 3 | Series 4 
H(y) 1.00 1.00 .76 .80 
Hzy(y) 98 54 68 55 


Hey, ayy) | 95 52 | 66 | .54 

















more and more explicitly the events 
occurring previous to his prediction. 
We have expressed this amount as an 
average value holding for an entire 
group of ten Ss which predicted a 
particular series. Only the last 120 
trials are considered in this analysis. 

Effect of previous events on the pre- 
dictions of subjects —We assumed that 
an exhaustive set of the events which 
preceded each prediction of each S on 
a particular trial included the symbols 
which S had predicted on previous 
trials and the symbols which had 
appeared on those trials. If only the 
events occurring on the preceding trial 
are considered, there are, then, four 
experiential states in which S could 
be at the beginning of a trial. These 
are (H,h), (H,v), (V,h), and (V,»), 
where the capital letter denotes the 
symbol predicted by S on the preced- 
ing trial and the small letter denotes 
the symbol which appeared. Each of 
these states has associated with it the 
computed probability that S will be 
in that state at the beginning of a 
trial.2 Using this probability and 
the computed probability that S 
would predict a particular symbol 
when he was in each of these experien- 


2 For example, »(H,h) is computed from 
p(H,h) = Nu,»/N 


where Ny,, is the number of trials on which S 
correctly predicted the H symbol and N is the 
total number of trials. 


tial states,? we computed H,,(y) as 
our average uncertainty in stating 
which symbol S will predict when we 
know what he predicted on the pre- 
ceding trial and we know which symbol 
actually appeared. These quantities, 
computed separately for each group 
of Ss, are given in line (2) of Table 2. 
On line (1) the average uncertainties 
are presented which result when the 
events of the previous trial are ignored, 
and on line (3) the average uncer- 
tainties are presented which result 
when the events of the previous two 
trials are considered. Consideration 
of the events occurring on the preced- 
ing three trials did not further decrease 
our uncertainty appreciably. 

Knowledge of these joint events 
which occurred on previous trials in 
our study always decreases our uncer- 
tainty in stating which symbol S is 
likely to predict on a given trial. 
This is especially true of Ss predicting 
Series 2 and 4 which are the two series 
containing sequential dependencies, 
but it is true also to a lesser degree of 
Ss predicting Series 1 and 3 which 
contained no sequential dependencies. 
Knowledge of previous events pro- 
duces little reduction in our uncer- 
tainty about Ss predicting Series 1. 
This is not surprising as the series 
itself is chaotic and provides no clues 
as to its future behavior. 

We next determined the degree to 
which knowledge of the separate 
events occurring on previous trials 
decreased our uncertainty. We com- 
puted H,(y) which is our average un- 
certainty in stating which symbol S 
will predict when we know only what 
he predicted on the previous trial. 


3 The probability, p#,,(H), is computed from 
pu, H) = Nu,ra/Nu,r 


where Ng,1,H is the number of trials on which S 
predicted the H symbol after correctly predicting 
it on the previous trial. 
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The behavior of the symbol series is 
ignored here. These quantities are 
given in Table 3 along with H(y), our 
average uncertainty resulting when 
previous predictions are ignored, and 
H,, ,(y), our average uncertainty 
resulting when the previous two pre- 
dictions of Ss are considered. With 
the exception of Ss’ predicting Series 
1, our uncertainty in stating what Ss 
will predict on a given trial is reduced 
in every case when we know only what 
they have predicted on previous trials. 
The Ss predicting Series 3 appear to 
be most responsive to the predictions 
made on previous trials. Since this 
symbol series contained no sequential 
dependencies, we could expect that 
these Ss would not find important 
clues about the future behavior of the 
series in the past behavior of the series 
itself. They may have looked for 
such clues instead in the previous pre- 
dictions which they had made. This 
was unjustifiable since the symbol 
series was entirely determined in ad- 
vance and independent of the pre- 
dictions of Ss. The discriminatory 
response of Ss to previous responses 
which they had made has been reported 
before, however, in a situation where 
the memory of previous responses was 
the only discriminable stimulus pres- 
ent on each trial (10). 

We should expect Ss predicting 
Series 2 and 4, which contained sequen- 
tial dependencies, to be more respon- 
sive to the past behavior of the symbol 


TABLE 3 


Tue Averace UNCERTAINTIES RESULTING 
WHEN THE Previous PREDICTIONS 
or Suspyects ArE Known 








aa. Series 1 | Series 2 | Series 3 | Series 4 





H(y) 1.00 | 1.00 76 80 
Hy) 1.00 83 72 75 
Hy, y(y) 1.00 83 72 73 

















TABLE 4 


Tue AveracGe UNcERTAINTIES RESULTING 
WHEN THE Previous Sympo.s, ApPEAR- 
ING IN THE SymsBot Series, ARE 
Known 








U i... A . Series 1 | Series 2 | Series 3 | Series 4 
H(y) 1.00 | 1.00 | .76 | .80 
99 69 .74 .70 

Hz, -(y) 98 | 55 | 70 | .56 

















series itself. This notion was tested 
by computing two new quantities. 
These are H,(y), our uncertainty in 
stating what S will predict when we 
know only which symbol had ap- 
peared on the previous trial, and 
H,, -(y), our uncertainty when we 
know only which symbols had ap- 
peared on the previous two trials. 
The previous predictions of Ss are 
ignored here. The symbols which 
had occurred previous to a given trial, 
of course, are always constant for all 
Ss predicting the same series. These 
quantities are presented in Table 4 
and approximate more closely those of 
Table 2. The particular symbols 
which had appeared on preceding 
trials were more important in deter- 
mining predictions on _ succeeding 
trials than were the predictions which 
Ss had made on preceding trials. This 
is especially true of Ss predicting sym- 
bol series with sequential dependencies. 

We have described three classes of 
events occurring on previous trials 
which influence the predictions which 
Ss make on the next succeeding trial. 
These three classes of events are the 
predictions which Ss made on previous 
trials, the symbols which appeared on 
previous trials, and the joint occur- 
rence of these two events on previous 
trials. We have shown that our 
knowledge of each of these classes of 
events decreases our uncertainty in 
stating which symbol S will predict on 
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TABLE 5 


Tue AveraGeE UNCERTAINTIES RESULTING 
WHEN THE CoRRECTNESS OF PREVIOUS 
Prepictions Is Known 











ne Series 1 | Series 2 | Series 3 | Series 4 
HA(y) 1.00 1.00 76 80 
H(y) 1.00 99 74 78 
He, ey) 1.00 89 .70 76 

















the next trial. The greatest decrease 
in our uncertainty occurs when we 
know the joint occurrences of events 
on preceding trials (Table 2). It 
would be convenient and interesting 
if we could state the proportion of this 
maximum decrease in uncertainty 
which results when we know only 
either which symbols S predicted or 
which symbols appeared on preceding 


TABLE 6 


Tue AveRAGE UNCERTAINTIES OF 
THE SymBo. SERIES 











aa. Series 1 | Series 2 | Series 3 | Series 4 
H(x) 1.00 1.00 81 80 
H.Ax) 1.00 78 80 57 





Hz, A(x) 98 | .77 | .79 | .56 














trials (Tables 3 and 4). 
do so for two reasons. 

1. It is apparent from the tables 
that the amount by which our uncer- 
tainty is reduced by knowledge of the 
joint events of previous trials is not 
equal to the sum of the amounts by 
which it is reduced by knowledge of 
each of the two separate classes of 
events. It can be shown that this 
will never be true so long as the occur- 


We cannot 
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Fic. 2. The proportion of trials, in successive 24-trial blocks, on which the symbol H was 
predicted when it had been predicted on the preceding trial 


rences of these separate events on 
previous trials are not separately and 
independently determined (11). We 
have shown that they are not inde- 
pendently determined since the pre- 
dictions made by S are influenced by 
the symbols which have appeared 
(Table 4). 

2. We have found that the events 
of previous trials which we have de- 
scribed so far are not exhaustive of the 
events which can occur on those trials, 
as S views them. In the case of three 
groups, our uncertainty is reduced if 
we know only whether previous symbol 
appearances were correctly predicted. 
We need not know which symbol was 
correctly anticipated. These uncer- 
tainties, H.(y) and H,, -(y) are given 
in Table 5. 


Our average uncertainty in stating 


which symbol will appear in the symbol 
series on a given trial when we know 
which symbols have appeared on pre- 
vious trials is given for each series in 
Table 6. The degree to which the 
symbol series was statistically struc- 
tured is less in every case than were 
the predictions of Ss, as shown by the 
uncertainties of Table 2. 

Changes in prediction trends through- 
out the trial series —The conclusions 
which we have made, of course, are 
based upon averaged data of the last 
120 trials. Actually, each S did not 
respond in the same way every time 
he experienced a particular succession 
of events. Following the occurrence 
of some events the predictions of Ss 
were indeterminate and showed no 
systematic trends. Following other 
events, there were striking systematic 
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changes in the predictions made. This 
was especially true of the predictions 
which followed trials on which a sym- 
bol had been correctly predicted. 

In Fig. 1 the proportion of trials, 
p(H), in each successive 24-trial block 
on which the symbol H was predicted 
is plotted for all prediction series. 
These curves are similar to those re- 
ported before for “‘acquisition” trials 
in verbal conditioning (2). These 
curves do not show that the sequential 
dependencies of Series 2 and 4 had 
any effect upon the predictions of our 
Ss. The predictions of Ss predicting 
Series 1 and 2 are identical, as are 
those of Ss predicting Series 3 and 4. 
The predictions of the two 50% groups 
are significantly different from the two 
75% groups (p9<.01). 

In Fig. 2 only those H predictions 
which follow an H prediction are 


plotted. In both of these figures it is 
apparent that the statistical character- 
istics of the prediction series resemble 
the characteristics of the symbol series. 

A different result is obtained, how- 
ever, when only those H predictions 
which follow a correct H prediction 
are plotted. In Fig. 3 we have plotted 
puc(H), the proportion of correct H 
predictions which were followed by an 
H prediction. The predictions of the 
groups predicting Series 2, 3, and 4 
are now almost indistinguishable. It 
appears that the predictions of the 
group watching Series 4 are somewhat 
more determined in the early trials, 
but by the halfway mark Ss in all 
three groups appear to respond in 
about the same way once they have 
experienced a trial on which they had 
correctly anticipated the H symbol. 
This provides the interesting result 
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that Ss who do not predict H an equal 
number of times over a series of trials 
(Series 2 and 3, Fig. 1) are in agree- 
ment as to exactly when it should be 
predicted. The difference between 
the results of Fig. 1 and Fig. 3 can be 
explained, of course, by the fact that 
Ss watching Series 2 and Ss watching 
Series 3 do not correctly anticipate 
the H symbol with equal frequency. 
This frequency is determined most 
importantly by the statistical charac- 
teristics of the symbol series. 

The transitional probability, pre, rc 
(H), for having correctly predicted H 
twice in succession and then predicting 
H again‘is presented in Fig. 4. The 
Ss predicting Series 2, 3, and 4 are 
still in fair agreement as to when H 
should be predicted. The Ss watch- 
ing Series 1, the chaotic series, how- 


ever, now show a systematic tendency 
to increase the likelihood of predicting 
H following two correct anticipations 
of that symbol. This upward trend 
is statistically significant as shown by 
a test for trend (9 <.025) (1). 

Not all sequences of previous events 
are such adequate stimuli for succeed- 
ing predictions. There is a tendency, 
rather, for Ss to respond most syste- 
matically to homogeneous runs of the 
same symbol, as has been reported 
before (5). The Ss of every group, 
however, showed highly systematic 
response tendencies following at least 
some previous events. This means 
that the statistical characteristics of 
the series of predictions of Ss were 
actually not in agreement with the 
characteristics of the symbol series 
they were predicting. It is true, as 
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has been reported previously (2), that 
the proportion of trials on which a 
symbol was predicted was about equal 
to the proportion of trials on which 
that symbol appeared. When the pre- 
dictions are analyzed sequentially, 
however, it is apparent that the pre- 
dictions of our Ss were far more deter- 
mined following some symbol 
sequences than were the symbols of 
the series themselves. 


Discussion 


An important result of this study is 
the evidence it provides to answer the 
question of how human Ss apparently 
can perceive statistical structuring in 
a sequence of events and reproduce 
this structuring in their own series of 
responses. We conclude from our 
evidence that Ss do not, in fact, per- 
ceive statistical structuring as such. 
That is, they do not perceive the 
probability rules by which the series 
of events was generated. They do per- 
ceive, instead, those short sequences 
of events which precede each predic- 
tion, which can be discriminated from 
other possible sequences, and which 
are found to provide some information 
about the future behavior of the sym- 
bol series. There are several inter- 
esting conclusions which we can make 
about the way in which Ss perceive 
these previous events. 

1. All combinations of possible pre- 
vious events were not discriminated 
with equal ease. Some previous 
events, especially homogeneous runs 
of the same symbol, were more easily 
discriminated and consistently re- 
sponded to than were others. 

2. The previous events to which our 
Ss responded on each trial included 
more than just the symbols which had 
been appearing. They included also 
the previous predictions of Ss and the 
degree of correspondence between 


their predictions and the symbols 
which appeared on previous trials. 

3. There was considerable agree- 
ment among our Ss as to when a 
particular symbol should be predicted. 
They tended to respond to some sim- 
ilar or identical previous events in the 
same way, no matter which series 
they were predicting (Fig. 3 and 4). 

The fact that the prediction series 
of Ss, after the first 120 trials, showed 
about the same statistical characteris- 
tics as the series being predicted 
(Fig. 1 and 2) is due to the interaction 
of the symbol series and these consist- 
ent ways in which Ss responded to 
some previous events. The effect of 
the symbol series being predicted is 
that of partially determining how 
often Ss could experience certain pre- 
vious events and of partially deter- 
mining the degree of relationship 
existing between these previous events 
and the future behavior of the symbol 
series. The statistical characteristics 
of the series of predictions of Ss are 
then at least partially determined by 
these two variables: (a) the number of 
times that S is given an opportunity 
to respond in a consistent way to dis- 
criminable sequences of previous 
events, and (b) the degree of success S 
has in predicting future symbol beha- 
vior by responding consistently to 
discriminable sequences of previous 
events. The statistical characteris- 
tics of the response series of Ss dre, 
then, the result of these stimulus- 
response relationships and not the 
cause of them. That is, Ss do not 
first perceive the rules by which the 
series were constructed and then 
respond accordingly. 

We believe that this notion is most 
in accord with our data and accounts 
for two factors which have not been 
explained previously. First, it has 
been noted that Ss do not act to 
maximize correct predictions by con- 
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sistently predicting the symbol whose 
probability of occurrence is greater 
than .50 (2). Second, it has been 
reported that Ss can make a par- 
ticular response a given proportion of 
the time without counting their pre- 
vious responses and without being 
aware that they are responding accord- 
ing to any statistical rule (10). The 
explanation for both of these results 
lies in the fact that Ss do not act like 
maximizing functions or like statis- 
tically governed mechanisms at all. 
The statistical characteristics of their 
responses are only by-products of the 
stimulus-response relations we have 
suggested. 

The results of this study and this 
explanatory notion have important 
implications for the concept of infor- 
mation which is derived from infor- 
mation theory. It has been stated 
that the informational value of the 
occurrence of one event from a set of 
events is a function of the likelihood 
of occurrence of that event (11). The 
likelihood of occurrence here is defined 
by the statistical rules by which this 
event is chosen from the set. This 
concept of information has had impor- 
tant applications where only physical 
systems are involved which select 
symbol after symbol, according to 
statistical laws, for transmission to a 
receiving system. When a human 
observer becomes the receiver of such 
a message, composed of a series of 
symbols chosen in that way, however, 
we can no longer specify the informa- 
tional value of the message to the 
observer by stating the statistical laws 
by which it was generated unless the 
degree to which the observer expects each 
symbol before it occurs conforms exactly 
to these statistical laws. We have 
shown, essentially, that this condition 
does not hold. It does not hold be- 
cause the observer does not directly 


perceive the rules by which a series of 
events is generated. That is, he does 
not perceive an apparently random 
series as being homogeneously ran- 
dom, but responds instead as though 
the series were composed of small sub- 
sequences, some of which are depend- 
able cues to the future behavior of the 
series. 

Furthermore, the observer does not 
predict the future behavior of a series 
of events purely on the basis of the 
past behavior of the series. The 
observer contaminates his judgment 
by including his own previous beha- 
vior and the “correctness” of his own 
previous behavior in the set of events 
which influences his expectancy of 
future events. 

This last point is important, we 
believe, because it means, if our Ss are 
typical, that Ss will always perceive 
an ambiguous series of events as being 
more structured than it really is. This 
must follow whenever Ss allow their 
own past behavior to influence their 
predictions of the future occurrence of 
series of events which are, in fact, 
completely independent of the be- 
havior of Ss. 


SUMMARY 


The Ss were required to predict on 
each of 240 trials which symbol of two 
possible symbols, H or V, would 
appear on that trial. Each of four 
groups of Ss predicted a different series 
of symbols. The four symbol series 
differed from each other in terms of 
the proportion of trials on which a 
particular symbol appeared and in the 
degree of sequential dependency exist- 
ing between successive symbols. 

The results were as follows: 


1. The Ss in all groups began pre- 
dicting the two symbols about equally 
often and gradually adjusted their pre- 
dictions to conform to the actual prob- 
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ability of occurrence of the symbols in 
the symbol series. The Ss tended also 
to predict a symbol, following a pre- 
vious prediction of that symbol, about 
as often as the symbol succeeded itself 
in the symbol series. 

2. The predictions of Ss were 
sequentially analyzed to determine 
the degree to which their predictions 
on a given trial were determined by 
the events occurring on preceding 
trials. The analysis showed that the 
specific stimuli to which Ss responded 
on each trial were the sequences of 
events, including the previous predic- 
tions of Ss and the correctness of these 
predictions, which had occurred on 
the previous two trials. Some se- 
quences of previous events, especially 
homogeneous series of a correctly pre- 
dicted symbol, were shown to be more 
adequate stimuli than were others. 

3..The apparent ability of Ss to 
perceive and reproduce sequential 
statistical structuring was explained 
in terms of response to specific se- 
quences of events occurring prior to 
each trial. The relevance of these 
results to the formulation of informa- 
tion transfer was discussed. 


(Received for early publication 
September 29, 1952) 
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editor accepts the article and the author is willing to pay the entire cost of 
increasing the next available issue by enough pages to add his article to the 
normal content) is now standard practice for all APA journals except 
Psychological Abstracts and the American Psychologist. 


The actual charge made to the author includes three items: 


1. A basic charge of so much per page. This is the minimum amount that 
it costs to add an additional page to the journal. For 1951 these costs are: 


Journat or ABNORMAL AND SoctAL PsycHOLocy 

JourNnat or ApPLiep PsycHOLOGYy 

JourNAL oF COMPARATIVE AND PHYSIOLOGICAL PsyYCHOLOGY.. 
JourNAL oF CoNsULTING PsycHOLOGY 

JOURNAL oF EXPERIMENTAL PsyCHOLOGY 

PsYCHOLOGICAL BULLETIN 


PsyYCHOLOGICAL REVIEW 


* Since each Psychological Monograph is printed separately, the author of one handled on 
an early publication basis can be charged exactly the cost of printing. The figure of $15.00 is 
an approximate one; the actual figure will be higher for very short monographs and lower for 
very long ones. The cost will also vary depending upon the amount of special composition and 
the illustrations used. 

These charges are based upon several factors: 


(a) The greater number of words on a particular journal page, 
the the cost per page. Conversely, the fewer words 
printed on the page, the lower the cost per page. 


(b) The more copies which must be printed, the higher the cost. 


(c) The more expensive the printer, the higher the cost. Com- 
pared to the factors listed above, this is not an important 
difference in the charges made. 


2. The full cost of any cuts or other es material, of special com- 
position for tables, and of author’s changes in proof. 


3. The full cost of any reprints ordered. (Authors of early-publication 
articles do not receive any free reprints.) 

















